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such proteins as well as transgenic non-human animals 
comprising the above-described polynucleotide or vec- 
tors are provided. Described are also methods for idem 
tifying and obtaining inhibitors for therapy of disorders 
related to the malfunction of the CYP3A4, CYP3A7 and 
hPXR genes as well as methods of diagnosing the sta- 
tus of-such disorders. Pharmaceutical and diagnostic 
compositions comprising the above-described polynu- 
cleotides, vectors, proteins, antibodies and inhibitors by 
the above-described method are provided. Said compo- 
sitions are particularly useful for diagnosing and treating 
various diseases with drugs that are substrates, inhibi- 
tors or modulators of the CYP3A4, CYP3A7 or hPXR 
gene product. 
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Description 

Field of the invention 

5 [0001] The present invention relates generally to means and methods of diagnosing and treating the phenotypic 
spectrum as well as the overlapping clinical characteristics with several forms of inherited abnormal expression and/ 
or function of the cytochrome P-450 (CYP)3A4 and CYP3A7 and the human pregnane X receptor (hPXR) genes. In 
particular, the present invention relates to polynucleotides of molecular variant CYP3A4, CYP3A7 and hPXR genes 
which, for example, are associated with abnormal drug response or individual predisposition to several common can- 

10 cers caused by environmental carcinogens, and to vectors comprising such polynucleotides. Furthermore, the present 
invention relates to host cells comprising such polynucleotides or vectors and their use for the production of variant 
CYP3A4, CYP3A7 and hPXR proteins. In addition, the present invention relates to variant CYP3A4, CYP3A7 and 
hPXR proteins and antibodies specifically recognizing such proteins. The present invention also concerns transgenic 
non-human animals comprising the above-described polynucleotide or vectors. Moreover, the present invention relates 

15 to methods for identifying and obtaining drug candidates and inhibitors for therapy of disorders related to the malfunction 
of the CYP3A4, CYP3A7 and hPXR genes as well as to methods of diagnosing the status of such disorders. The 
present invention furthermore provides pharmaceutical and diagnostic compositions comprising the above-described 
polynucleotides, vectors, proteins, antibodies, and drugs and inhibitors obtainable by the above-described method. 
Said compositions are particularly useful for diagnosing and treating various diseases with drugs that are substrates, 

20 inhibitors or modulators of the CYP3A4, CYP3A7 and hPXR genes or their product. 

Several documents are cited throughout the text of this specification. Each of the documents cited herein (including 
any manufacturer's specifications, instructions, etc.) are hereby incorporated herein by reference; however, there is 
no admission that any document cited is indeed prior art as to the present invention. ^ 

25 Background of the invention 

[0002] Members of the cytochrome P-450 (CYP) family of hemoproteins metabolise a wide variety of endogenous 
substrates such as steroid hormones, and of xenobiotics including carcinogens, toxins and drugs(f ,2). Of the human 
• CYP proteins, those of the CYP3A subfamily are of a major importance, since collectively, they are by far the most 

30 abundant of all the human CYP isoforms. Moreover, their substrate specificity is extremely broad; accordingly, many . 
structurally diverse compounds are, exclusively or to some extent, substrates for CYP3A proteins. Based on the data 
available it is generally assumed that all CYP3A isoforms have similar substrate spectra; however, limited studies 
indicate the possibility of differences (3). AH CYP3A isoforms are localized in organs of particular importance to drug 
disposition (gastrointestinal tract, kidney and liver). 

35 [0003] At least three functional CYP3A proteins exist in humans. The CYP3A4 monooxygenase is the predominant 
cytochrome P450 in human liver and small bowel. The protein displays a broad substrate specificity and it metabolises 
more than 60% of all drugs that are currently in use, including contraceptive steroids, antidepressants, benzodi- 
azepines, immunosuppressive agents, imidazole antimicotics, and macrolide antibiotics (4,5). In addition, CYP3A4 
plays a major role in the protection from environmental toxins. For example, the protein metabolizes aflatoxin B1 , which 

40 has been implicated in the etiology of liver cancer, which is a major cause of premature death in many areas of Africa 
and Asia. Aflatoxin B1 is a mycotoxin produced by species of Aspergillus, and human exposure results principally from 
the ingestion of stored foodstuffs contaminated with the mold. Carcinogenicity is associated with its conversion to 
8,9-oxide by the hepatic cytochrome P450-dependent monooxygenase system. Forrester et al. (6) found that the rates 
of metabolic activation of aflatoxin B1 were highly correlated with the level of proteins of the CYP3A gene family in the 

45 microsomes. Furthermore, Paolini et al. (7) found significant increases in CYP3A in the lungs of rats treated with high 
doses of beta-carotene. Consequently, it was proposed that correspondingly high levels of CYP3A4 in humans would 
predispose an individual to cancer risk from the bioactivated tobacco-smoke procarcinogens, thus explaining the co- 
carcinogenic effect of beta-carotene in smokers. All this implies that individual variation in the CYP3A4 activity could 
influence the efficacy of a variety of drug therapies as well as the individual predisposition to several major cancers 

so caused by environmental carcinogens. 

[0004] A considerable variation in the CYP3A4 content and catalytic activity has been, indeed, described in the 
general population. For example, the metabolic clearance of the gene substrates exhibits a unimodal distribution with 
up to 20-fold interindividual variability. The activities of the CYP3A4 protein in liver biopsies vary up to 30-fold (8). 
Furthermore, many common drugs alter the expression levels of the gene (induction or repression) and the extent of 

55 this phenomenon is individually variable. The inducers of CYP3A4 expression include commonly used drugs suchras 
the glucocorticoid dexamethasone, the antibiotic rifampicin, and the anttmycotic clotrimazole. The inducibility of 
CYP3A4 expression, combined with the diverse renge of substrates, creates a potential for potentially harmful drug 
interactions involving this isozyme in patients undergoing therapies with multiple drugs. 
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[0005] CYP3A3 is a very closely related isoform to CYP3A4 (>more than 98% cDNA sequence similarity), but it is 
not known whether this reflects a separate gene product or an allelic variant. By contrast, CYP3A5 is a gene distinct 
from CYP3A4 and it is expressed potymorphically both in the adult and fetal liver and in the kidney and intestine. In 
adult Caucasians, the mRNA and the protein were detected in the liver of 10 to 30% of samples, while the protein was 

5 detected in the kidney and intestine of 70% of subjects (Ref. (9) and references therein). A point mutation described 
in the CYP3A5 gene which possibly results in the synthesis of an unstable protein, may account for the polymorphic 
expression of this enzyme (9). CYP3A7 is the third functional CYP3A isoform. CYP3A7 was originally isolated from a 
fetal liver but it was subsequently found in 54% of adult livers (10). Tests to estimate the inducibility and the activity of 
CYP3A isozymes in an individual patient would be of obvious relevance for the optimization of therapies with drugs 

10 which are their substrates, and for the prevention of the associated side effects. Direct estimates of the activities of 
CYP3A isozymes in liver biopsies are possible but impracticable for both ethical and cost reasons. The indirect in vivo 
tests of CYP3A4 activity such as the erythromycin breath test or the 6-p-hydroxycortisol test pose ethical problems 
such as the invasive admininstration of undesirable probe substances and they are obviously unsuited for routine 
. testing. In addition, the lack of correlation between these tests questions their informative value regarding the CYP3A4 

15 \ activity (11). 

[0006] A major portion (83%) of the interindividual CYP3A4 variability has been attributed to genetic facors (12). The 
establishment of a genetic test for the activity of CYP3A4 and of the other CYP3A isozymes should be possible, as^ 
suming the prior identification of those factors. Genetic variance affecting the activity and the expression of CYP3A 
isozymes could be localized in the genes itself, or in one or more of their regulators. A comparison of the three originally 

20 published sequences of the best characterized CYP3A gene, CYP3A4, suggested the existence of polymorphisms 
■ affecting the amino acid sequence of the CYP3A4 protein (13). Unfortunately, this observation has not been, to our 
knowledge, confirmed in the general population. More recently, a polymorphism (CYP3A4-W) has been described in 
the nifedipine-specific response element of the CYP3A4 promoter (14). Its presence associates with a more advanced 
prostate tumor stage (14). Felix et al. (15) examined this polymorphism in 99 de novo and 30 treatment-related leuke- 

25 mias. In all treatment-related cases, there was prior exposure to one or more anticancer drugs metabolized by CYP3A, 
such as epipodophyllotoxins. These data suggest that individuals with the CYP3A4-W polymorphism may be at in- 
/ creased risk for treatment-related leukemia and that epipodophyllotoxin metabolism by CYP3A4 may contribute to the, 
secondary cancer risk. At present it is unclear if the polymorphism influences the expressivity or inductibility of the 
CYP3A4 protein. A first published analysis suggests that it has no effect on the basal expression level of CYP3A4X8). 

30 A point mutation was described in the CYP3A5 (9), whereas no mutations have been reported in CYP3A7. 

[0007] Experiments with amino acid exchanges artificially introduced into the CYP3A4 gene indicate that the function 
of the family members may be quite sensitive to amino acid exchanges (16-21). Besides amino acid exchanges, silent 
polymorphisms and those localized in untranslated or intronic sequences also could influence the expression level of 
these genes. Alternatively, such polymorphisms could serve as markers for nearby, unidentified polymorphisms. This 

35 ' effect is known as linkage, i.e. defined polymorphisms serve as markers for phenotypes that they are not causative for. 
[0008] . A major breakthrough in the understanding of the CYP3A expression and inducibility took place in 1 998 when 
three research groups independently showed that the expression of CYP3A4 is regulated by a member of the orphan 
nuclear receptor family termed PXR (pregnane X receptor), or PAR (22-24). Upon treatment with inducers of CYP3A4, 
PXR binds to the rifampicin/dexamethasone response element in the CYP3A4 promoter as a heterodirher with the 

40 9-cis retinoic acid receptor (RXR). Northern blot analysis detected most abundant expression of hPXR in liver, colon, 
and small intestine, i.e. in the major organs expressing CYP3A4. The available evidence suggests that human PXR 
serves as a key transcriptional regulator of the CYP3A4 gene. A recent report describes the induction of CYP3A7 
mediated by PXR suggesting that ail members of the family may be regulated by this common transcriptional activator 
(25). 

45 [0009] It is clear that naturally occurring mutations, if they exist can have effects on drug metabolization and efficacy 
of therapies with drugs, in particular in cancer treatment. It is unknown, however, how many of such variations exist, 
and with what frequency and at what positions in the human CYP3A4, CYP3A7 and hPXR genes. 
Accordingly, means and methods for diagnosing and treating a variety of forms of individual drug intolerability and 
. inefficacy of drug therapy which result from CYP3A4, CYP3A7 and/or hPXR gene polymorphisms that interfere e.g.; 

50 with chemotherapeutic treatment of diseases, in particular cancer, was hitherto not available but are nevertheless 
highly desirable. 

[0010] Thus, the technical problem of the present invention is to comply with the needs described above. 
The solution to this technical problem is achieved by providing the embodiments characterized in the claims. 

55 Summary of the Invention 

[001 1] The present invention is based on the finding of novel, so far unknown variations in the nucleotide sequences 
of the CYP3A4, CYP3A7 and hPXR genes and the population distribution of these alleles. Based upon the knowledge 
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of these novel sequences diagnostic tests and reagents for such tests were designed for the specific detection and 
genotyping of CYP3A4, CYP3A7 and hPXR alleles in humans, including homozygous as well as heterozygous, frequent 
as well as rare alleles of the CYP3A4, CYP3A7 and hPXR genes. The determination of the CYP3A4, CYP3A7 and/or 
hPXR gene allele status of humans with such tests is useful for the optimization of therapies with the numerous sub- 
5 strates of CYP3A4 and CYP3A7. It may also be useful in the determination of the individual predisposition to several 
common cancers caused by environmental carcinogens. 

Iii a first embodiment, the invention provides polynucleotides of molecular variant CYP3A4, CYP3A7 and hPXR genes 
and embodiments related thereto such as vectors, host cells, variant CYP3A4, CYP3A7 and hPXR proteins and meth- 
ods for producing the same. 

.10 in yet another embodiment, the invention provides methods for identifying and obtaining drug candidates and inhibitors 
of CYP3A4, CYP3A7 and hPXR for therapy of disorders related to acquired drug hypo- or hypersensitivity as well as 

- methods of diagnosing the status of such disorders. 

In a further embodiment, the invention provides pharmaceutical and diagnostic compositions comprising the above- 
■ : described polynucleotides, vectors containing the same, proteins, antibodies thereto, and drugs and inhibitors obtain- 
15 able by the above-described method. 

The pharmaceutical and diagnostic compositions, methods and uses of the invention are useful for the diagnosis and 

- treatment of cancer and other diseases the therapy of which is dependent on drug treatment and tolerance. The novel 
, variant forms of CYP3A4, CYP3A7 and hPXR genes according to the invention provide the potential for the development 

■ ■ of a pharmacodynamic profile of drugs for a given patient. 

20 . , 

Description of the invention 

[0012] The finding and characterization of variations in the CYP3A4, GYP3A7 and hPXR genes, and diagnostic tests 
for the discrimination of different CYP3A4, CYP3A7 and hPXR alleles in human individuals provide a very potent tool 
25 ' for improving the therapy of diseases with drugs that are targets of the CYP3A4 or CYP3A7 gene product, and whose 
metabolization is therefore dependent on CYP3A4 or CYP3A7. The diagnosis of the individual allelic CYP3A4, CYP3A7 
* and hPXR status permits a more focused therapy, e.g., by opening the possibility to apply individual dose regimens of 
. drugs. It may also be useful as prognostic tool for therapy outcome. Furthermore, diagnostic tests to genotype CYP3A4, 
CYP3A7 and hPXR, and novel CYP3A4, CYP3A7 and hPXR variants, will not only improve therapy with established 
30 : .. drugs and help to correlate genotypes with drug activity or side effects. These tests and sequences also provide rea- 
gents for the development of novel inhibitors that specifically modulate the activity of the individual types of CYP3A4, 
CYP3A7 and hPXR. Expression in yeast, for example, of three allelic cDNAs encoding human liver CYP3A4 and meth- 
- - ods for testing the binding properties and catalytic activities of their gene products have been described in (13). 
v [001 3] Thus, the present invention provides a novel way to exploit molecular biology and pharmalogical research for 
35 - drug therapy while bypassing their potential detrimental effects which are due to, expression of variant CYP3A4, 
CYP3A7 and hPXR genes. 

• ■■ [0014] - Accordingly, the invention relates to a polynucleotide selected from the group consisting of: 

. -i (a) polynucleotides comprising a nucleotide sequence encoding the amino acid sequence encodable by a nucle- 
40 1 ■ otide sequence of SEQ ID NO: 86, 94 and/or 98; ' 

• i ■ (b) polynucleotides comprising a nucleotide sequence encoding a molecular variant of the cytochrome CYP3A4 

protein by way of at least one amino acid deletion, addition and/or substitution at an amino acid position corre- 
sponding amino acid residue Gly56 in exon 3 of the CYP3A4 gene; 

(c) polynucleotides comprising a nucleotide sequence encoding a molecular variant of the cytochrome CYP3A7 
45^ ; protein by way of at least one amino acid deletion, addition and/or substitution, preferably at an amino acid position 

*■ corresponding amino acid residue Thr 409 in exon 11 of the CYP3A7 gene; 

(d) polynucleotides comprising a nucleotide sequence of any one of SEQ ID NOS: 86, 87, 90, 91 , 94, 95 or 108; 

(e) polynucleotides comprising a nucleotide sequence of any one of SEQ ID NOS: 98, 99, 102, 103, 106 or 107; and 

(f) polynucleotides comprising a nucleotide sequence encoding a CYP3A4 or CYP3A7 polypeptide or fragment 
50 thereof having an epitope comprising an amino acid sequence encoded by the nucleotide sequence of SEQ ID 

NO: 86 and/or 94. 

[0015] In the context of the present invention the term "molecular variant" CYP3A4, CYP3A7 or hPXR gene or protein 
as used herein means that said CYP3A4, CYP3A7 or hPXR gene or protein differs from the wild type CYP3A4, CYP3A7 
55 or hPXR gene or protein by way of nucleotide substitution(s), addition(s) and/or deletion(s) (Genomic sequences of 
the CYP3A4, CYP3A7 gene are described, for example Bork, J Biol Chem 264 (1 989), 910-9; Hashimoto, Eur J Biochem 
21 8 (1993), 585-95; Beaune, Proc Natl Acad. Sci USA 83 (1986), 8064-8; Malowa, Proc Natl Acad Sci U S A 83 (1 986), 
5311-5; Accession numbers: M14096, J04449, X12387, M18907. The numbering of the polymorpisms refers to the 
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sequence M 14096; forCYP3A7: the reference sequence is described in Komori, J Biochem (Tokyo) 105 (1989), 161-3; 
Accession number: gi4503232 and for the hPXR gene in Bertilsson, Proc Natl Acad Sci U S A. 95 (1998), 12208-13; 
Lehmann, J Clin Invest. 102 (1998), 1016-23; Accession numbers: AF061056, AF084645, AF084644. The numbering 
of the polymorphisms refers to the sequence AF084645). Preferably, said nucleotide substitution(s) result(s) in a cor- 

5 responding change in the amino acid sequence of the CYP3A4, CYP3A7 or hPXR protein. 

[0016] In accordance with the present invention, the mode and population distribution of novel so far unidentified 
genetic variations in the CYP3A4, CYP3A7 and hPXR gene have been analyzed by sequence analysis of relevant 
regions of the human CYP3A4, CYP3A7 and hPXR genes from many different individuals. It is a well known fact that 
genomic DNA of individuals, which harbor the individual genetic makeup of all genes, including CYP3A4, CYP3A7 and 

10 hPXR can easily be purified from individual blood samples. These individual DNAsamples are then used for the analysis 
of the sequence composition of the CYP3A4, CYP3A7 and hPXR gene alleles that are present in the individual which 
provided the blood sample. The sequence analysis was carried out by PCR amplification of relevant regions of the 
CYP3A4, CYP3A7 and hPXR gene, subsequent purification of the PCR products, followed by automated DNA se- 
quencing with established methods (ABI dyeterminator cycle sequencing). 

15 One important parameter that had to be considered in the attempt to determine the individual CYP3A4, CYP3A7 and/ 
or hPXR genotype and identify novel CYP3A4, CYP3A7 or hPXR variants by direct DNA-sequencihg of PCR-products 
from human blood genomic DNA is the fact that each human harbors (usually, with very few abnormal exceptions) two 
gene copies of each autosomal gene (diploidy). Because of that, great care had to be taken in the evaluation of the 
sequences to be able to identify unambiguously not only homozygous sequence variations but also heterozygous 

20 variations. The details of the different steps in the identification and characterization of novel CYP3A4, CYP3A7 and 
hPXR gene polymorphisms (homozygous and heterozygous) are described in the examples below. 
[001 7] The mutations in the CYP3A4, CYP3A7 and hPXR genes detected in accordance with the present invention 
are illustrated in Figure 5 and 6, respectively (indicated by an arrow). The methods of the mutation analysis followed 
standard protocols and are described in detail in the examples. In general such methods to be used in accordance 

25 with the present invention for evaluating the phenotypic spectrum as well as the overlapping clinical characteristics 
with other forms of drug metabolization and altered tolerance to drugs in patients with mutations in the CYP3A4, 1 
CYP3A7 or hPXR gene encompass for example haplotype analysis, single-strand conformation polymorphism analysis 
(SSCA), PCR and direct sequencing. On the basis of thorough clinical characterization of many patients the phenotypes 
can then be correlated to these mutations as well as to mutations that had been described earlier. 

30 ' [001 8] As is evident to the person skilled in the art this new molecular genetic knowledge can now be used to exactly 
characterize the genotype of the index patient where a given drug takes an unusual effect and of his family. 
[0019]: Over the past 20 years, genetic heterogeneity has been increasingly recognized as a significant source of 
variation in drug response. Many scientific communications (Meyer, Ann. Rev. Pharmacol. ToxicoL 37 (1997), 269-296 
and West, J. Clin. Pharmacol. 37 (1997), 635-648) have clearly shown that some drugs work better or may even be 
. 35 highly toxic in some patients than in others and that these variations in patient's responses to drugs can be related to 
molecular basis. This "pharmacogenomic" concept spots correlations between responses to drugs and genetic profiles 
of patient's (Marshall, Nature Biotechnology, 15 (1997), 954-957; Marshall, Nature Biotechnology, 15 (1997), 
1249-1252). 

[0020] ■ In this context of population variability with regard to drug therapy, pharmacogenomics has been proposed 
40 as a tool useful in the identification and selection of patients which can respond to a particular drug without side effects. 
This identification/selection can be based upon molecular diagnosis of genetic polymorphisms by genotyping DNA 
from leukocytes in the blood of patient, for example, and characterization of disease (Bertz, Clin. Pharmacokinet. 32 
(1997), 210-256; Engel, J. Chromatogra. B. Biomed. Appl. 678 (1996), 93-103). For the providers of health care, such 
as health maintenance organizations in the US and government public health services in many European countries; 
45 this pharmacogenomics approach can represent a way of both improving health care and reducing overheads because 
there is a large cost to unnecessary therapies, ineffective drugs and drugs with side effects. 
[0021] The mutations in the variant CYP3A4, CYP3A7 and hPXR genes sometime result in amino acid deletion(s), 
insertion(s) and in particular in substttution(s) either alone or in combination. It is of course also possible to genetically 
engineer such mutations in wild type genes or other mutant forms. Methods for introducing such modifications in the 
50 DNA sequence of CYP3A4, CYP3A7 or hPXR gene are well known to the person skilled in the art; see, e.g., Sambrook, 
Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory (1 989) N.Y. 

In a preferred embodiment of the invention, the above described polynucleotide encodes a variant CYP3A4, CYP3A7 
or hPXR protein or fragment thereof, e.g., comprising one or more epitopes of the amino acid sequence encoded by 
SEQ ID NOS: 86 and 94. 

55 [0022] For the investigation of the nature of the alterations in the amino acid sequence of the CYP3A4, CYP3A7 and 
hPXR proteins computer programs may be used such as BRASMOL that are obtainable from the Internet. Furthermore, 
folding simulations and computer redesign of structural motifs can be performed using other appropriate computer 
programs (Olszewski, Proteins 25 (1996), 286-299; Hoffman, Comput. Appl. Biosci. 11 (1995), 675-679). Computers 
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can be used for the conformational and energetic analysis of detailed protein models (Monge, J. Mol. Biol. 247 (1995), 
995-1012; Renouf, Adv. Exp. Med. Biol. 376 (1995), 37-45). These analysis can be used for the identification of the 
influence of a particular mutation on binding and/or metabotization of drugs. 

[0023] Usually, said amino acid deletion, addition or substitution in the amino acid sequence of the protein encoded 

5 by the polynucleotide of the invention is due to one or more nucleotide substitution, insertion or deletion, or any com- 
binations thereof. Preferably said nucleotide substitution, insertion or deletion results in an amino acid substitution of 
Gly56 to Asp in exon 3 of the CYP3A4 gene and/or Thr409 to Arg in exon 11 of the CYP3A7 gene. 
[0024] The polynucleotide of the invention may further comprise at least one nucleotide and optionally amino acid 
deletion, addition and/or substitution other than those specified hereinabove, for example those described in the prior 

10 art; e.g.,(13). This embodiment of the present invention allows the study of synergistic effects of the mutations in the 
CYP3A4, CYP3A7 or hPXR gene on the pharmalogical profile of drugs in patients who bear such mutant forms of the 
gene or similar mutant forms that can be mimicked by the above described proteins. It is expected that the analysis of 
said synergistic effects provides deeper insights into drug tolerant or sensitive phenotypes of certain forms of cancer 
and other diseases. From said deeper insight the development of diagnostic and pharmaceutical compositions related 

15 to cancer will greatly benefit. 

[0025] Thus, in a preferred embodiment, the present invention relates to polynucleotides of molecular variant 
CYP3A4, CYP3A7 and hPXR genes, wherein the nucleotide deletion, addition and/or substitution result in altered 
expression of the variant CYP3A4, CYP3A7 or hPXR gene compared to the corresponding wild type gene. 
[0026] The polynucleotide of the invention may be, e.g., DNA, cDNA, genomic DNA, RNA or synthetically produced 

20 DNA or RNA or a recombinant^ produced chimeric nucleic acid molecule comprising any of those polynucleotides 
either alone or in combination. Preferably said polynucleotide is part of a vector, particularly plasm ids, cosmids, viruses 
and bacteriophages used conventionally in genetic engineering that comprise a polynucleotide of the invention. Such 
vectors may comprise further genes such as marker genes which allow for the selection of said vector in a suitable 
host cell and under suitable conditions. 

25 [0027] In a further preferred embodiment of the vector of the invention, the polynucleotide of the invention is opera- 
tive^ linked to expression control sequences allowing expression in prokaryotic or eukaryotic cells. Expression of said 
polynucleotide comprises transcription of the polynucleotide, preferably into a translatable mRN A. Regulatory elements 
ensuring expression in eukaryotic cells, preferably mammalian cells, are well known to those skilled in the art. They 
usually comprise regulatory sequences ensuring initiation of transcription and optionally poly-A signals ensuring ter- 

30 mination of transcription and stabilization of the transcript. Additional regulatory elements may include transcriptional 
as well as translational enhancers. Possible regulatory elements permitting expression in prokaryotic host cells com- 
prise, e.g., the /ac, trp or tac promoter in E. co//, and examples for regulatory elements permitting expression in eu- 
karyotic host cells are the AOX1 or GAL1 promoter in yeast or the CMV-, SV40- , RSV-promoter (Rous sarcoma virus), 
CMV-enhancer, SV40-enhancer or a globin intron in mammalian and other animal cells. Beside elements which are 

35 responsible for the initiation of transcription such regulatory elements may also comprise transcription termination 
signals, such as the SV40-poly-A site or the tk-poly-A site, downstream of the polynucleotide. In this context, suitable 
expression vectors are known In the art such as Okayama-Berg cDNA expression vector pcDV1 (Pharmacia), pCDM8, 
pRc/CMV, pcDNAI, pcDNA3 (In-vitrogene), pSPORTI (GIBCO BRL). Preferably, said vector is an expression vector 
and/or a gene transfer or targeting vector. Expression vectors derived from viruses such as retroviruses, vaccinia virus, 

40 adeno-associated virus, herpes viruses, or bovine papilloma virus, may be used for delivery of the polynucleotides or 
vector of the invention into targeted cell population. Methods which are well known to those skilled in the art can be 
used to construct recombinant viral vectors; see, for example, the techniques described in Sambrook, Molecular Cloning 
A Laboratory Manual, Cold Spring Harbor Laboratory (1989) N.Y. and Ausubel, Current Protocols in Molecular Biology, 
Green Publishing Associates and Wiley Interscience, N.Y. (1994). Alternatively, the polynucleotides and vectors of the 

45 invention can be reconstituted into liposomes for delivery to target cells. 

[0028] The present invention furthermore relates to host cells transformed with a polynucleotide or vector of the 
invention. Said host cell may be a prokaryotic or eukaryotic cell; see supra. The polynucleotide or vector of the invention 
which is present in the host cell may either be integrated into the genome of the host cell or it may be maintained 
extrachromosomaily. In this respect, it is also to be understood that the recombinant DNA molecule of the invention 

50 can be used for "gene targeting" and/or "gene replacement", for restoring a mutant gene or for creating a mutant gene 
via homologous recombination; see for example Mouellic, Proc. Natl. Acad. Sci. USA, 87 (1990), 4712-4716; Joyner, 
Gene Targeting, A Practical Approach, Oxford University Press. 

The host cell can be any prokaryotic or eukaryotic cell, such as abacterial, insect, fungal, plant, animal or human cell. 
Preferred fungal cells are, for example, those of the genus Saccharomyces, in particular those of the species S. cer- 
55 evisiae. The term "prokaryotic" is meant to include all bacteria which can be transformed or transfected with a polynu- 
cleotide for the expression of a variant CYP3A4, CYP3A7 or hPXR protein or fragment thereof. Prokaryotic hosts may 
include gram negative as well as gram positive bacteria such as, for example, E. coli, S. typhimurium, Serratia marc- 
escens and Bacillus subtilis. A polynucleotide coding for a mutant form of CYP3A4, CYP3A7 and hPXR variant proteins 



6 



EP 1 088 900 A1 



can be used to transform or transfect the host using any of the techniques commonly known to those of ordinary skill 
in the art. Methods for preparing fused, operably linked genes and expressing them in bacteria or animal cells are well- 
known in the art (Sambrook, supra). The genetic constructs and methods described therein can be utilized for expres- 
sion of variant CYP3A4, CYP3A7 and hPXR proteins in, e.g., prokaryotic hosts. In general, expression vectors con- 

5 taining promoter sequences which facilitate the efficient transcription of the inserted polynucleotide are used in con- 
nection with the host. The expression vector typically contains an origin of replication, a promoter, and a terminator, 
as well as specific genes which are capable of providing phenotypic selection of the transformed cells. The transformed 
prokaryotic hosts can be grown in fermentors and cultured according to techniques known in the art to achieve optimal 
cell growth. The proteins of the invention can then be isolated from the grown medium, cellular lysates, or cellular 

10 membrane fractions. The isolation and purification of the microbially or otherwise expressed polypeptides of the inven- 
tion may be by any conventional means such as, for example, preparative chromatographic separations and immuno- 
logical separations such as those involving the use of monoclonal or polyclonal antibodies. 

[0029] Thus, in a further embodiment the invention relates to a method for the production of variant GYP3A4, CYP3A7 
and hPXR proteins and fragments thereof comprising culturing a host cell as defined above under conditions allowing 

15 the expression of the protein and recovering the produced protein or fragment from the culture. 

[0030] In another embodiment the present invention relates to a method for producing cells capable of expressing 
a variant CYP3A4, CYP3A7 and/or hPXR gene comprising genetically engineering cells with the polynucleotide or with 
the vector of the invention. The cells obtainable by the method of the invention can be used, for example, to test drugs 
according to the methods described in Sambrook, Fritsch, Maniatis (1989). Molecular cloning: a laboratory manual. 

20 Cold Spring Harbour Laboratory press, Cold Spring Harbour; Peyronneau, Eur J Biochem 218 (1993), 355-61; Yama- 
zaki, Carcinogenesis 16 (1995), 2167-2170. Furthermore, the cells can be used to study known drugs and unknown 
derivatives thereof for their ability to complement loss of drug efficacy caused by mutations in the CYP3A4, CYP3A7 
or hPXR gene. For these embodiments the host cells preferably lack a wild type allele, preferably both alleles of the 
CYP3A4, CYP3A7 and/or hPXR gene and/or have at least one mutated from thereof. Alternatively, strong overexpres- 

25 sion of a mutated allele over the normal allele and comparison with a recombinant cell line overexpressing the normal 
allele at a similar level may be used as a screening and analysis system. The cells obtainable by the above-described 
method may also be used for the screening methods referred to herein below. 

[0031] Furthermore, the invention relates to variant CYP3A4, CYP3A7 and hPXR proteins and fragments thereof 
encoded by a polynucleotide according to the invention or obtainable by the above-described methods or from cells 
30 produced by the method described above. In this context it is also understood that the variant CYP3A4, CYP3A7 and 
hPXR proteins according to the invention may be further modified by conventional methods known in the art. By pro- 
viding the variant CYP3A4, CYP3A7 and hPXR proteins according to the present invention it is also possible to deter- 
mine the portions relevant for their biological activity or inhibition of the same. 

[0032] The present invention furthermore relates to antibodies specifically recognizing a variant CYP3A4, CYP3A7 
35 or hPXR protein according to the invention. Advantageously, the antibody specifically recognizes an epitope containing 
one or more amino acid substitution(s) as defined above 

Antibodies against the variant CYP3A4, CYP3A7 or hPXR protein of the invention can be prepared by well known 
methods using a purified protein according to the invention or a (synthetic) fragment derived therefrom as an antigen. 
Monoclonal antibodies can be prepared, for example, by the techniques as originally described in Kohler and Milstein, 

40 Nature 256 (1975), 495, and Galfre, Meth. Enzymol. 73 (1981), 3, which comprise the fusion of mouse myeloma cells 
to spleen cells derived from immunized mammals. The antibodies can be monoclonal antibodies, polyclonal antibodies 
or synthetic antibodies as well as fragments of antibodies, such as Fab, Fv or scFv fragments etc. Furthermore, anti- 
bodies or fragments thereof to the aforementioned polypeptides can be obtained by using methods which are described, 
e.g., in Harlow and Lane "Antibodies, A Laboratory Manual", CSH Press, Cold Spring Harbor, 1988. These antibodies 

45 can be used, for example, for the immunoprecipitation and immunolocalization of the variant CYP3A4, CYP3A7 and 
hPXR proteins of the invention as well as for the monitoring of the presence of such variant CYP3A4, CYP3A7 and 
hPXR proteins, for example, in transgenic organisms, and for the identification of compounds interacting with the pro- 
teins according to the invention. For example, surface plasmon resonance as employed in the BIAcore system can be 
used to increase the efficiency of phage antibodies which bind to an epitope of the protein of the invention (Schier, 

so Human Antibodies Hybridomas 7 (1996), 97-105; Malmborg, J. Immunol. Methods 183 (1995), 7-13). 

[0033] Furthermore, the present invention relates to nucleic acid molecules which represent or comprise the com- 
plementary strand of any of the above described polynucleotides or a part thereof, thus comprising at least one nucle- 
otide difference compared to the corresponding wild type CYP3A4, CYP3A7 and hPXR gene nucleotide sequences 
specified by the above described nucleotide substitutions, deletions and additions. Such a molecule may either be a 

55 deoxyribonucleic acid or a ribonucleic acid. Such molecules comprise, for example, antisense RNA. These molecules 
may furthermore be linked to sequences which when transcribed code for a ribozyme thereby producing a ribozyme 
which specifically cleaves transcripts of polynucleotides according to the invention. 

[0034] Furthermore, the present invention relates to a vector comprising a nucleic acid molecule according to the 
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invention. Examples for such vectors are described above. Preferably, the nucleic acid molecule present in the vector 
is operatively linked to regulatory elements permitting expression in prokaryotic or eukaryotic host cells; see supra. 
[0035] The present invention also relates to a method for the production of a transgenic non-human animal, preferably 
transgenic mouse, comprising introduction of a polynucleotide or vector of the invention into a germ cell, an embryonic 

5 cell, stem cell or an egg or a cell derived therefrom, the non-human animal can be used in accordance with the method 
of the invention described below and may be a non-transgenic healthy animal, or may have a disorder, preferably a 
disorder caused by at least one mutation in the GYP3A4, CYP3A7 and/or hPXR gene. Such transgenic animals are 
well suited for, e.g., pharmacological studies of drugs in connection with variant forms of the above described variant 
CYP3A4, CYP3A7 and hPXR proteins since these proteins or at least their functional domains are conserved between 

10 : i species in higher eukaryotes, particularly in mammals. Production of transgenic embryos and screening of those can 
be performed, e.g., as described by A. L. Joyner Ed., Gene Targeting, A Practical Approach (1993), Oxford University 
Press. The DNA of the embryos can be analyzed using, e.g., Southern blots with an appropriate probe. 
. [0036] The invention also relates to transgenic non-human animals such as transgenic mouse, rats, hamsters, dogs, 
, monkeys, rabbits, pigs, C. elegans and fish such as torpedo fish comprising a polynucleotide or vector of the invention 

15 or obtained by the method described above, preferably wherein said polynucleotide or vector is stably integrated into 
the genome of said non-human animal, preferably such that the presence of said polynucleotide or vector leads to the 
expression of the variant CYP3A4, CYP3A7 and/or hPXR gene of the invention. It may have one or several copies of 
the same or different polynucleotides of the variant CYP3A4, CYP3A7 or hPXR gene. This animal has numerous 
utilities, including as a research model for drug tolerability and therefore, presents a novel and valuable animal in the 

20 development of therapies, treatment, etc. for diseases caused by deficiency or failure of drug metabolization in the 
cell. Accordingly, in this instance, the mammal is preferably a laboratory animal such as a mouse or rat. 
[0037] Preferably, the transgenic non-human animal of the invention further comprises at least one inactivated wild 
< type allele of the CYP3A4, CYP3A7 and/or hPXR gene. This embodiment allows for example the study of the interaction 
of various variant forms of CYP3A4, CYP3A7 and hPXR proteins. It might be also desirable to inactivate CYP3A4, 

25 CYP3A7 and/or hPXR gene expression or function at a certain stage of development and/or life of the transgenic 
animal. This can be achieved by using, for example, tissue specific, developmental and/or cell regulated and/or induc- 
ible promoters which drive the expression of, e.g., an antisense or ribozyme directed against the RNA transcript of the 

- CYP3A4, CYP3A7 or hPXR gene; see also supra. A suitable inducible system is for example tetracycline-regulated 
. gene expression as described, e.g., by Gossen and Bujard (Proc. Natl. Acad. Sci. 89 USA (1992), 5547-5551) and 

30 Gossen et al. (Trends Biotech. 12 (1994), 58-62). Similar, the expression of the variant CYP3A4, CYP3A7 and hPXR 
gene may be controlled by such regulatory elements. 

[0038] With the variant CYP3A4, CYP3A7 and hPXR polynucleotides and proteins and vectors of the invention, it is 
now possible to study in vivo and in vitro the efficiency of drugs in relation to particular mutations in the CYP3A4, 
. CYP3A7 or hPXR gene of a patient and the affected phenotype. Furthermore, the variant CYP3A4, CYP3A7 and hPXR 

35 proteins of the invention can be used to determine the pharmacological profile of drugs and for the identification and 
preparation of further drugs which may be more effective for the treatment of, e.g., cancer, in particular for the amel- 
.. ioration of certain phenotypes caused by the respective mutations such as those described above. 

[0039] Thus, a particular object of the present invention concerns drug/pro-drug selection and formulation of phar- 
maceutical compositions for the treatment of diseases which are amenable to chemotherapy taking into account the 

40 polymorphism of the variant form of the CYP3A4, CYP3A7 or hPXR gene that cosegregates with the affected phenotype 
of the patient to be treated. This allows the safe and economic application of drugs which for example were hitherto 
- considered not appropriate for therapy of, e.g., cancer due to either their side effects in some patients and/or their 
unreliable pharmalogical profile with respect to the same or different phenotype(s) of the disease. The means and 
methods described herein can be used, for example, to improve dosing recommendations and allows the prescriber 

45 to anticipate necessary dose adjustments depending on the considered patient group. 

[0040] In a further embodiment the present invention relates to a method of identifying and obtaining an CYP3A4, 

- CYP3A7 or hPXR inhibitor capable of modulating the activity of a molecular variant of the CYP3A4, CYP3A7 or hPXR 
gene or its gene product comprising the steps of 

50 (a) contacting the variant CYP3A4, CYP3A7 or hPXR protein or a cell expressing a molecular variant gene com- 

prising a polynucleotide of the invention in the presence of components capable of providing a detectable signal 
in response to drug metabolization, with a compound to be screened under conditions to permit CYP3A4 or 
CYP3A7 mediated drug metabolization, and 

(b) detecting the presence or absence of a signal or increase of a signal generated from the metabolized drug, 
55 . wherein the presence or increase of the signal is indicative for a putative inhibitor. 

[0041] The term "compound" in a method of the invention includes a single substance or a plurality of substances 
which may or may not be identical. 
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Said compound(s) may be chemically synthesized or produced via microbial fermentation but can also be comprised 
in, for example, samples, e.g., cell extracts from, e.g., plants, animals or microorganisms. Furthermore, said compounds 
may be known in the art but hitherto not known to be useful as an inhibitor, respectively. The plurality of compounds 
may be, e.g., added to the culture medium or injected into a cell or non-human animal of the invention. 

5 If a sample containing (a) compound(s) is identified in the method of the invention, then it is either possible to isolate 
the compound from the original sample identified as containing the compound, in question or one can further subdivide 
the original sample, for example, if it consists of a plurality of different compounds, so as to reduce the number of 
different substances per sample and repeat the method with the subdivisions of the original sample. 
[0042] It can then be determined whether said sample or compound displays the desired properties, for example, 

10 by the methods described herein or in the literature (e.g. (13) and Lehmann, J Clin Invest 102 (1998), 1016-23). De- 
pending on the complexity of the samples, the steps described above can be performed several times, preferably until 
the sample identified according to the method of the invention only comprises a limited number of or only one substance 
(s). Preferably said sample comprises substances of similar chemical and/or physical properties, and most preferably 
said substances are identical. The methods of the present invention can be easily performed and designed by the 

15 person skilled in the art, for example in accordance with other cell based assays described in the prior art or by using 
and modifying the methods as described herein. Furthermore, the person skilled in the art will readily recognize which 
further compounds and/or enzymes may be used in order to perform the methods of the invention, for example, en- 
zymes, if necessary, that convert a certain compound into the precursor which in turn represents a substrate for the 
CYP3A4 or CYP3A7 protein. Such adaptation of the method of the invention is well within the skill of the person skilled 

20 in the art and can be performed without undue experimentation. 

[0043] Suitable assays which can be employed in accordance with the present invention are described, for example, 
in Hashimoto, Eur J Biochem 218 (1993), 585-95 wherein transfection assays with chimeric CYP3A4 genes in HepG2 
cells are described. Similarly, the variant CYP3A4, CYP3A7 and/or hPXR genes can be expressed or co-expressed 
in HepG2 cells and analyzed for their transcriptional activity and catalytic properties of CYP3A4 or CYP3A7. Such an 

25 assay can also be used for studying the catalytic properties of the CYP3A4 and CYP3A7 op its substrates such as 
steroids (testosterone; progesterone, androstenedione, Cortisol, 17p-oestradiol, 17oc-ethynyloestradiol), antibiotics 
, (erythromycin), immunosuppressive (cyclosporine A), benzodiazepine (midazolam), benzothiazepine derivatives 
(diltiazem, triazolam), and nifedipine. In particular, such tests are useful to add in predicting whether a given drug will 
interact in an individual carrying the respective variant CYP3A4, CYP3A7 and/or hPXR gene. A suitable expression 

30 system which can be employed in accordance with above described methods of the present invention is also described 
in (22). In addition heterologous expression systems such as yeast can be used in order to study the stability, binding 
properties and catalytic activities of the gene products of the variant CYP3A4, CYP3A7 and hPXR genes compared 
to the corresponding wild type gene product. As mentioned before, the molecular variant CYP3A4, CYP3A7 and hPXR 
genes and their gene products, particularly when employed in the above described methods, can be used for pharma- 

35 cological and toxicological studies of the metabolism of drugs. Preferred drugs to be tested in accordance with the 
methods of the present invention comprise those described above and include, but are not limited to nifedipine, eryth- 
romycin, troleandomycin, quinidine, cyclosporin A, 17o>ethynylestradipl, lidocaine, diltiazem, dexamethasone, RU486; 
see also supra. 

Compounds which can be used in accordance with the present invention include peptides, proteins, nucleic acids, 
40 antibodies, small organic compounds, ligands, peptidomimetics, PNAs and the like. Said compounds can also be 
functional derivatives or analogues of known drugs such as from those described above. Methods for the preparation • 
of chemical derivatives and analogues are well known to those skilled in the art and are described in, for example, 
Beilstein, Handbook of Organic Chemistry, .Springer edition New York Inc., 175 Fifth Avenue, New York, N.Y. 10010 
U.S.A. and Organic Synthesis, Wiley, New York, USA. Furthermore, said derivatives and analogues can be tested for 
45 their effects according to methods known in the art or as described. Furthermore, peptide mimetics and/or computer- 
aided design of appropriate drug derivatives and analogues can be used, for example, according to the methods de- 
scribed below. Such analogs comprise molecules having as the basis structure of known CYP3A4 and CYP3A7-sub- 
strates and/or inhibitors and/or modulators; see infra. 

Appropriate computer programs can be used for the identification of interactive sites of a putative inhibitor and the 
so CYP3A4, CYP3A7 or hPXR protein of the invention by computer assistant searches for complementary structural 
motifs (Fassina, Immunomethods 5 (1994), 114-120). Further appropriate computer systems for the computer aided 
design of protein and peptides are described in the prior art, for example, in Berry, Biochem. Soc. Trans. 22 (1994), 
1033-1036; Wodak, Ann. N. Y. Acad. Sci. 501 (1987), 1-13; Pabo, Biochemistry 25 (1986), 5987-5991. The results 
obtained from the above-described computer analysis can be used in combination with the method of the Invention 
55 for, e.g., optimizing known inhibitors. Appropriate peptidomimetics and other inhibitors can also be identified by the 
synthesis of peptidomimetic combinatorial libraries through successive chemical modification and testing the resulting 
compounds, e.g., according to the methods described herein. Methods for the generation and use of peptidomimetic 
combinatorial libraries are described in the prior art, for example in Ostresh, Methods in Enzymology 267 (1996), 
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220-234 and Dorner, Bioorg. Med. Chem. 4 (1996), 709-715. Furthermore, the three-dimensional and/or crystallo- 
graphic structure of inhibitors and the CYP3A4, CYP3A7 or hPXR protein of the invention can be used for the design 
of peptidomimetic drugs {Rose, Biochemistry 35 (1996), 12933-12944; Rutenberg, Bioorg. Med. Chem. 4 (1996), 
1545-1558). 

5 [0044] In summary, the present invention provides methods for identifying and obtaining compounds which can be 
used in specific doses for the treatment of specific forms of diseases, e.g., cancer the chemotherapy of which is com- 
plicated by malfunctions of the CYP3A4, CYP3A7 or hPXR gene often resulting in an altered activity or level of drug 
metabolization or sensitive phenotype. 

[0045] In a preferred embodiment of the method of the invention said cell is a cell of or, obtained by the method of 
10 the invention or is comprised in the above-described transgenic non-human animal. 

[0046] In a further embodiment the present invention relates to a method of identifying and obtaining an CYP3A4, 
CYP3A7 or hPXR inhibitor capable of modulating the activity of a molecular variant of the CYP3A4, CYP3A7 or hPXR 
gene or its gene product comprising the steps of 

15 (a) contacting the variant CYP3A4, CYP3A7 or hPXR protein of the invention with a first molecule known to be 

bound by CYP3A4, CYP3A7 or hPXR protein to form a first complex of said protein and said first molecule; 

(b) contacting said first complex with a compound to be screened; and 

(c) measuring whether said compound displaces said first molecule from said first complex. 

20 [0047] Advantageously, in said method said measuring step comprises measuring the formation of a second complex 
of said protein and said inhibitor candidate. Preferably, said measuring step comprises measuring the amount of said 
first molecule that is not bound to said protein. 

[0048] In a particularly preferred embodiment of the above-described method of said first molecule is nifedipine, 
rifampicine or corticosterone. Furthermore, it is preferred that in the method of the invention said first molecule is 
25 labeled, e.g., with a radioactive or fluorescent label. 

. [0049] In a still further embodiment the present invention relates to a method of diagnosing a disorder related to the* 
presence of a molecular variant CYP3A4, CYP3A7 or hPXR gene or susceptibility to such a disorder comprising 

. (a) determining the presence of a polynucleotide of the invention in a sample from a subject; and/or 
30 (b) determining the presence of a variant form of CYP3A4, CYP3A7 or hPXR protein, for example, with the antibody 

of the invention. 

[0050] In accordance with this embodiment of the present invention, the method of testing the status of a disorder 
or susceptibility to such a disorder can be effected by using a polynucleotide or a nucleic acid molecule of the invention, 

35 e.g., in the form of a Southern or Northern blot or in situ analysis. Said nucleic acid sequence may hybridize to a coding 
region of either of the genes or to a non-coding region, e.g. intron. In the case that a complementary sequence is 
employed in the method of the invention, said nucleic acid molecule can again be used in Northern blots. Additionally, 
said testing can be done in conjunction with an actual blocking, e.g., of the transcription of the gene and thus is expected 
to have therapeutic relevance. Furthermore, a primer or oligonucleotide can also be used for hybridizing to one of the 

40 above-mentioned CYP3A4, CYP3A7 or hPXR genes or corresponding mRNAs. The nucleic acids used for hybridization 
can, of course, be conveniently labeled by incorporating or attaching, e.g., a radioactive or other marker. Such markers 
are well known in the art. The labeling of said nucleic acid molecules can be effected by conventional methods. 
Additionally, the presence or expression of variant CYP3A4, CYP3A7 and hPXR genes can be monitored by using a 
primer pair that specifically hybridizes to either of the corresponding nucleic acid sequences and by carrying out a PCR 

45 reaction according to standard procedures. Specific hybridization of the above mentioned probes or primers preferably 
occurs at stringent hybridization conditions. The term "stringent hybridization conditions" is well known in the art; see, 
for example, Sambrook et al., "Molecular Cloning, 

[0051] A Laboratory Manual" second ed. t CSH Press, Cold Spring Harbor, 1989; "Nucleic Acid Hybridisation, A 
Practical Approach", Hames and Higgins eds., IRL Press, Oxford, 1985. Furthermore, the mRNA, cRNA, cDNA or 

so genomic DNA obtained from the subject may be sequenced to identify mutations which may be characteristic finger- 
prints of mutations in the CYP3A4, CYP3A7 and hPXR gene. The present invention further comprises methods wherein 
such a fingerprint may be generated by RFLPs of DNA or RNA obtained from the subject, optionally the DNA or RNA 
may be amplified prior to analysis, the methods of which are well known in the art. RNA fingerprints may be performed 
by, for example, digesting an RNA sample obtained from the subject with a suitable RNA-Enzyme, for example RNase 

55 T 1 , RNase T 2 or the like or a ribozyme and, for example, electrophoretically separating and detecting the RNA fragments 
as described above. Further modifications of the above-mentioned embodiment of the invention can be easily devised 
by the person skilled in the art, without any undue experimentation from this disclosure; see, e.g., the examples. An 
additional embodiment of the present invention relates to a method wherein said determination is effected by employing 
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an antibody of the invention or fragment thereof. The antibody used in the method of the invention may be labeled with 
detectable tags such as a histidine flags or a biotin molecule. 

[0052] In a preferred embodiment of the present invention, the above described methods comprise PCR, ligase chain 
reaction, restriction digestion, direct sequencing, nucleic acid amplification techniques, hybridization techniques or 

5 immunoassays (Sambrook et al., loc. cit. CSH cloning, Harlow and Lane loc. cit. CSH antibodies). 
[0053] In a preferred embodiment of the method of the present invention said disorder is cancer. 
[0054] In a further embodiment of the above-described method, a further step comprising administering to the subject 
a medicament to abolish or alleviate said variations in the CYP3A4, CYP3A7 or hPXR gene in accordance with all 
applications of the method of the invention allows treatment of a given disease before the onset of clinical symptoms 

10 due to the phenotype response caused by the CYP3A4, CYP3A7 or hPXR gene. 

[0055] In a preferred embodiment of the method of the invention said medicament are chemotherapeutic agents 
such as substrates of CYP3A4: paclitaxen (Eur J Drug Metab Pharmacokinet 23 (1998), 417-24), tamoxifen and 
toremifene (Drug Metab Dispos 27(1999), 681-8; Clin Pharmacol Ther 64 (1998), 648-54; Clin Pharmacol Ther 57 
(1995), 628-35), trofosfamide (Cancer Chemother Pharmacol 44(1999), 327-334), cyclophosphamide and ifosfamide 

15 (Drug Metab Dispos 27 (1999), 655-66; Cancer Res 58 (1998), 4391-401; Br J Clin Pharmacol 40 (1995), 523-30), 
taxotere (Pharmacogenetics 8 (1998), 391-401; Clarke, Clin Pharmacokinet 36 (1999), 99-114). 
[0056] In another preferred embodiment of the above-described methods, said method further comprises introducing 

(i) a functional and expressible wild type CYP3A4, CYP3A7 or hPXR gene or 
20, (jj) a nucleotide acid molecule or vector of the invention into cells. 

[0057] In this context and as used throughout this specification, "functional" CYP3A4, CYP3A7 and hPXR gene 
means a gene wherein the encoded protein having part or all of the primary structural conformation of the wild type 
CYP3A4, CYP3A7 and hPXR protein, i.e. possessing the biological property of metabolizing drugs and controlling the 

25 CYP3A4, CYP3A7 gene, respectively. This embodiment of the present invention is suited for therapy of cancer in 
particular in humans. Detection of the expression of a variant CYP3A4 or CYP3A7 and/or hPXR gene would allow the 
conclusion that said expression is interrelated to the generation or maintenance of a corresponding phenotype of the 
disease. Accordingly, a step would be applied to reduce the expression level to low levels or abolish the same. This ■ 
can be done, for example, by at least partial elimination of the expression of the mutant gene by biological means, for 

30 example, by the use of ribozymes, antisense nucleic acid molecules, intracellular antibodies or the above described 
inhibitors against the variant forms of these CYP3A4, CYP3A7 and/or hPXR proteins. Furthermore, pharmaceutical 
products may be developed that reduce the expression levels of the corresponding mutant proteins and genes. 
[0058] In a further embodiment the invention relates to a method for the production of a pharmaceutical composition 
comprising the steps of any one of the above described methods and synthesizing and/or formulating the compound 

35 identified in step (b) or a derivative or homologue thereof in a pharmaceutically acceptable form. The therapeutically 
useful compounds identified according to the method of the invention may be formulated and administered to a patient 
as discussed above. For uses and therapeutic doses determined to be appropriate by one skilled in the art see infra. 
[0059] Furthermore, the present invention relates to a method for the preparation of a pharmaceutical composition 
comprising the steps of the above : described methods; and formulating a drug or prodrug in the form suitable for 

40 therapeutic application and preventing or ameliorating the disorder of the subject diagnosed in the method of the in- 
vention. 

[0060] Drugs or pro-drugs after their in vivo administration are metabolized in order to be eliminated either by excre- 
tion or by metabolism to one or more active or inactive metabolites (Meyer, J. Pharmacokinet. Biopharm. 24 (1996), 
449-459). Thus, rather than using the actual compound or inhibitor identified and obtained in accordance with the 
45 methods of the present invention a corresponding formulation as a pro-drug can be used which is converted into its 
active in the patient. Precautionary measures that may be taken for the application of pro-drugs and drugs are described 
in the literature; see, for review, Ozama, J. Toxicol. Sci. 21 (1996), 323-329). 

[0061] In a preferred embodiment of the method of the present invention said drug or prodrug is a derivative of a 
medicament as defined hereinbefore. 

so [0062] In a still further embodiment the present invention relates to an inhibitor identified or obtained by the method 
described hereinbefore. Preferably, the inhibitor binds specifically to the variant CYP3A4, CYP3A7 or hPXR protein of 
the invention. The antibodies, nucleic acid molecules and inhibitors of the present invention preferably have a specificity 
at least substantially identical to binding specificity of the natural ligand or binding partner of the CYP3A4, CYP3A7 or 
hPXR protein of the invention. An antibody or inhibitor can have a binding affinity to the CYP3A4, CYP3A7 or hPXR 

55 protein of the invention of at least 10 5 M _1 , preferably higher than 10 7 M" 1 and advantageously up to 10 10 M' 1 in case 
CYP3A4, CYP3A7 or hPXR activity should be repressed. Hence, in a preferred embodiment, a suppressive antibody 
or inhibitor of the invention has an affinity of at least about 1 0* 7 M, preferably at least about 1 0 -9 M and most preferably 
at last about 10" 11 M. 
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[0063] Furthermore, the present invention relates to the use of an oligo- or polynucleotide for the detection of a 
polynucleotide of the invention and/or for genotyping of corresponding individual CYP3A4, CYP3A7 or hPXR alleles. 
Preferably, said oligo- or polynucleotide is a polynucleotide or a nucleic acid molecule of the invention described before. 
[0064] In a particular preferred embodiment said oligonucleotide is about 15 to 50, preferably 20 to 40, more pref- 
5 erably 20 to 30 nucleotides in length and comprises the nucleotide sequence of any one of SEQ ID NOS: 1 to 107 or 
a complementary sequence. 

[0065] Hence, in a still further embodiment, the present invention relates to a primer or probe consisting of an oligo^ 
nucleotide as defined above. In this context, the term "consisting of means that the nucleotide sequence described 
above and employed for the primer or probe of the invention does not have any further nucleotide sequences of the 

10 CYP3A4, CYP3A7 or hPXR gene immediately adjacent at its 5' and/or 3' end. However, other moieties such as labels, 
e.g., biotin molecules, histidin flags, antibody fragments/colloidal gold, etc. as well as nucleotide sequences which do 
not correspond to the CYP3A4, CYP3A7 or hPXR gene may be present in the primer and probes of the present inven- 
tion. Furthermore, it is also possible to use the above described particular nucleotide sequences and to combine them 
with other nucleotide sequences derived from the CYP3A4, CYP3A7 or hPXR gene wherein these additional nucleotide 

15 sequences are interspersed with moieties other than nucleic acids or wherein the nucleic acid does not correspond to 
nucleotide sequences of the CYP3A4, CYP3A7 or hPXR gene. Furthermore, it is evident to the person skilled in the 
art that the oligonucleotide can be modified, for example, by thio-phosphate-backbones and/or base analogs well 
known in the art (Flanagan, Proc. Natl. Acad. Sci. USA 96 (1 999), 351 3-8; Witters, Breast Cancer Res. Treat. 53 (1999), 
41-50; Hawley, Antisense Nucleic Acid Drug Dev. 9 (1999), 61-9; Peng Ho, Brain Res. Mol. Brain Res. 62 (1998), 1-11; 

20 . Spiller, Antisense Nucleic Acid Drug Dev. 8 (1998), 281-93; Zhang, J. Pharmacol. Exp. Ther. 278 (1996), 971-9; Shoji, 
Antimicrob. Agents Chemother. 40 (1996), 1670-5; Crooke, J. Pharmacol. Exp. Ther. 277 (1996), 923-37). 
[0066] In addition, the present invention relates to the use of an antibody or a substance capable of binding specifically 
to the gene product of an,CYP3A4, CYP3A7 or hPXR gene for the detection of the variant CYP3A4, CYP3A7 or hPXR 
protein of the invention, the expression of a molecular variant CYP3A4, CYP3A7 or hPXR gene comprising a polynu- 

25 cleotide of the invention and/or for distinguishing CYP3A4, CYP3A7 and hPXR alleles comprising a polynucleotide of 
the invention. 

[0067] Moreover, the present invention relates to a composition, preferably pharmaceutical composition comprising 
the antibody, the nucleic acid molecule, the vector or the inhibitor of the present invention, and optionally a pharma- 
' ceutically acceptable carrier. These pharmaceutical compositions comprising, e.g., the inhibitor or pharmaceutically 
30 acceptable salts thereof may conveniently be administered by any of the routes conventionally used for drug admin- 
istration, for instance, orally, topically, parenterally or by inhalation. Acceptable salts comprise acetate, methylester, 
HCI, sulfate, chloride and the like. The compounds may be administered in conventional dosage forms prepared by 
combining the drugs with standard pharmaceutical carriers according to conventional procedures. These procedures 
may involve mixing, granulating and compressing or dissolving the ingredients as appropriate to the desired prepara- 
35 tion. It will be appreciated that the form and character of the pharmaceutically acceptable character or diluent is dictated 
by the amount of active ingredient with which it is to be combined, the route of administration and other well-known 
variables. The carrier(s) must be "acceptable" in the sense of being compatible with the other ingredients of the for- 
mulation and not deleterious to the recipient thereof. The pharmaceutical carrier employed may be, for example, either 
a solid or liquid. Exemplary of solid carriers are lactose, terra alba, sucrose, talc, gelatin, agar, pectin, acacia, magne- 
to sium stearate, stearic acid and the like. Exemplary of liquid carriers are phosphate buffered saline solution, syrup, oil 
such as peanut oil and olive oil, water, emulsions, various types of wetting agents, sterile solutions and the like. Similarly, 
the carrier or diluent may include time delay material well known to the art, such as glyceryl monostearate or glyceryl 
distearate alone or with a wax. 

The dosage regimen will be determined by the attending physician and other clinical factors; preferably in accordance 
45 with any one of the above described methods. As is well known in the medical arts, dosages for any one patient depends 
upon many factors, including the patient's size, body surface area, age, the particular compound to be administered, 
sex, time and route of administration, general health, and other drugs being administered concurrently. Progress can 
be monitored by periodic assessment. 

[0068] Furthermore, the use of pharmaceutical compositions which comprise antisense-oligonucleotides which spe- 
50 cifically hybridize to RNA encoding mutated versions of a CYP3A4, CYP3A7 or hPXR gene according to the invention 
or which comprise antibodies specifically recognizing mutated CYP3A4, CYP3A7 or hPXR protein but not or not sub- 
stantially the functional wild-type form is conceivable in cases in which the concentration of the mutated form in the 
cells should be reduced. 

Thanks to the present invention the particular drug selection, dosage regimen and corresponding patients to be treated 
55 can be determined in accordance with the present invention. The dosing recommendations will be indicated in product 
labeling by allowing the prescriber to anticipate dose adjustments depending on the considered patient group, with 
information that avoids prescribing the wrong drug to the wrong patients at the wrong dose. 

[0069] Furthermore, the present invention relates to a diagnostic composition or kit comprising any one of the afore- 
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described polynucleotides, vectors, host cells, variant CYP3A4, CYP3A7 and hPXR proteins, antibodies, inhibitors, 
nucleic acid molecules or the corresponding vectors of the invention, and optionally suitable means for detection. 
The kit of the invention may contain further ingredients such as selection markers and components for selective media 
suitable for the generation of transgenic cells and animals. The kit of the invention may advantageously be used for 

5 carrying out a method of the invention and could be, inter alia, employed in a variety of applications, e.g., in the diag- 
nostic field or as research tool. The parts of the kit of the invention can be packaged individually in vials or in combination 
in containers or multicontainer units. Manufacture of the kit follows preferably standard procedures which are known 
to the person skilled in the art. The kit or diagnostic compositions may be used for methods for detecting expression 
of a mutant form of CYP3A4, CYP3A7 or hPXR gene in accordance with any one of the above-described methods of 

10 the invention, employing, for example, immunoassay techniques such as radioimmunoassay or enzymeimmunoassay 
or preferably nucleic acid hybridization and/or amplification techniques such as those described herein before and in 
the examples. 

[0070] Some genetic changes lead to altered protein conformational states. For example, some variant CYP3A4, 
CYP3A7 or hPXR proteins may possess a tertiary structure that renders them far less capable of facilitating drug 

15 metabolization and transcription initiation, respectively. Restoring the normal or regulated conformation of mutated 
proteins is the most elegant and specific means to correct these molecular defects, although it is difficult. Pharmaco- 
logical manipulations thus may aim at restoration of wild-type conformation of the protein. Thus, the polynucleotides 
and encoded proteins of the present invention may also be used to design and/or identify molecules which are capable 
of activating the wild-type function of a CYP3A4, CYP3A7 or hPXR gene or protein. 

20 [0071] In another embodiment the present invention relates to the use of a drug or prodrug for the preparation of a 
pharmaceutical composition for the treatment or prevention of a disorder diagnosed by the method described herein- 
before. 

Furthermore, the present invention relates to the use of an effective dose of a nucleic acid sequence encoding a 
functional and expressible wild type CYP3A4, CYP3A7 or hPXR protein for the preparation of a pharmaceutical com- 

25 position for treating, preventing and/or delaying a disorder diagnosed by the method of the invention. A gene encoding 
a functional and expressible CYP3A4, CYP3A7 or hPXR protein can be introduced into the cells which in turn produce 
the protein of interest. Gene therapy, which is based on introducing therapeutic genes into cells by ex-vivo or in-vivo 
techniques is one of the most important applications of gene transfer. Suitable vectors and methods for in-vitro or in- 
vivo gene therapy are described in the literature and are known to the person skilled in the art; see, e.g., Giordano, 

30 Nature Medicine 2 (1996), 534-539; Schaper, Circ. Res. 79 (1996), 911-919; Anderson, Science 256 (1992), 808-813; 
Isner, Lancet 348 (1996), 370-374; Muhlhauser, Circ. Res. 77 (1995), 1077-1086; Wang, Nature Medicine 2 (1996), 
714-716; W094/29469; WO 97/00957 or Schaper, Current Opinion in Biotechnology 7 (1996), 635-640, and references 
cited therein. The gene may be designed for direct introduction or for introduction via liposomes, or viral vectors (e.g. 
adenoviral, retroviral) into the cell. Preferably, said cell is a germ line cell, embryonic cell, or egg cell or derived there- 

35 * from, most preferably said cellis a stem cell. 

[0072] As is evident from the above, it is preferred that in the use of the invention the nucleic acid sequence is 
operatively linked to regulatory elements allowing for the expression and/or targeting of the CYP3A4, CYP3A7 or hPXR 
protein to specific cells. Suitable gene delivery systems that can be employed in accordance with the invention may 
include liposomes, receptor-mediated delivery systems, naked DNA, and viral vectors such as herpes viruses, retro- 

40 - viruses, adenoviruses, and adeho-associated viruses, among others. Delivery of nucleic acids to a specific site in the 
body for gene therapy may also be accomplished using a biolistic delivery system, such as that described by Williams 
(Proc. Natl. Acad. Sci. USA 88 (1991 ), 2726-2729). Standard methods for transfecting cells with recombinant DNA are 
well known to those skilled in the art of molecular biology, see, e.g., WO 94/29469; see also supra. Gene therapy may 
be carried out by directly administering the recombinant DNA molecule or vector of the invention to a patient or by 

45 transfecting cells with the polynucleotide or vector of the invention ex vivo and infusing the transfected cells into the 
patient. 

[0073] In a preferred embodiment of the uses and methods of the invention, said disorder is cancer. 

[0074] These and other embodiments are disclosed or are obvious from and encompassed by the description and 

examples of the present invention. Further literature concerning any one of the methods, uses and compounds to be 

50 employed in accordance with the present invention may be retrieved from public libraries, using for example electronic 
devices. For example the public database "Medline" may be utilized which is available on Internet, e.g. under http:// 
www.ncbi.nlm.nih.gov/PubMed/medline.html. Further databases and addresses, such as http://www.ncbi.nlm.nih.gov/, 
http://www.infobiogen.fr/, http://www.fmi.ch/biology/research_tools.html, http://www.tigr.org/, are known to the person 
skilled in the art and can also be obtained using, e.g., http://www.lycos.com. An overview of patent information in 

55 biotechnology and a survey of relevant sources of patent information useful for retrospective searching and for current 
awareness is given in Berks, TIBTECH 12 (1994), 352-364. 

[0075] The pharmaceutical and diagnostic compositions, uses, methods of the invention can be used for the diag- 
nosis and treatment of all kinds of diseases hitherto unknown as being related to or dependent on variant CYP3A4, 
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CYP3A7 and hPXR genes. The compositions, methods and uses of the present invention may be desirably employed 
in humans, although animal treatment is also encompassed by the methods and uses described herein. 

Brief description of the figures 

5 

[0076] 

Figure 1: Differences in the genetic makeup influence the efficacy and safety of drug treatment. 
10 Figure 2: A current model of the regulation of CYP3A4 by hPXR. 



Figure 3: (A) Structure of the CYP3A4 gene as described in Ref. by Hashimoto, Eur Biochem 218 (1993) 585-95 
and confirmed in this study. Coding regions are indicated. as filled rectangles, the non-coding 5' untrans- 
lated region as a dashed rectangle. Arrowheads represent the positions of oligonucleotides used to 
*5 screen the coding region of the gene (see Table 2 for the oligonucleotide sequences). (B) Determination 

of the exon flanking sequences. Double-headed arrows indicate genomic regions amplified by PCR. 
Single-headed arrows indicate sequences obtained by direct sequencing of BAC clones. The sequences 
of oligonucleotides shown in (B) are given in Table 1. 

20 Figure 4: . (A) Structure of the hPXR gene. Coding regions are indicated as filled rectangles, non-coding 5* and 3* 
untranslated regions as dashed rectangles. Arrowheads represent the positions of oligonucleotides used 
to screen the coding region of the gene (see Table 5 for the oligonucleotide sequences). (B) Elucidation 
of the genomic structure of hPXR. Double-headed arrows indicate genomic regions amplified by PCR. 
Oligonucleotides used to isolate an hPXR-containjng BAC clone are shown in bold. Single-headed arrows 

25 indicate sequences obtained by direct sequencing of this clone. (C) Differential expression of hPXR tran- 

scripts in the liver (L) and small intestine (SI) as investigated by PCR amplification of tissue-derived cDNA 
pools. Exon 1b is expressed in both target tissues of the gene whereas exon 1a is expressed only in the 
liver. The sequences of oligonucleotides shown in (B) and (C) are given in Table 3. 

30 Figure 5: Polymorphisms in the CYP3A4 and CYP3A7 genes. Numbering of polymorphic sites within the exons of 
. CYP3A4 is based on the GenBank sequence M 14096. Numbering of polymorphic sites within the exons 
of CYP3A47 is based on the GenBank sequence gi4503232. 

Figure 6: Polymorphisms in the hPXR gene. Numbering of polymorphic sites within the exons of hPXR is based 
on the GenBank sequence AF084645. 

Figure 7: An enzymatic test for the CYP3A7 exon 1 1 C1229G (Thr409Arg) polymorphism. (A) The mutation elim- 
inates the unique Alwnl restriction site- from the exon 11 -containing fragment amplified with oligonucle- 
otides 3A442F and 3A438R (arrowheads). (B) Genotyping of a wild-type (wt/wt) and a heterozygous (wt/ 
C1229G) DNA sample by means of the Alwnl restriction digest. 

Figure 8: Genomic sequences and polymorphisms in CYP3A4, CYP3A7 and hPXR genes. Primers used for the 
amplification and sequencing (Tables 1 and 3), as well as splice sites are underlined. Thick underlined 
are polymorphic sites and they are shown as the wild-type and variant base, separated by an arrow. 



35 



40 



45 , 



50 



55 



[0077] The invention will now be described by reference to the following examples which are merely illustrative and 
are not to be construed as a limitation of the scope of the present invention. 

Examples 

[0078] Genomic samples, isolated by standard techniques from 22 human livers were obtained from Prof. Urs Meyer, 
Biozentrum der Universitat Basel. Liver samples were obtained under consideration of all legal, ethical and medical 
requirement of the Biozentrum der Universitat Basel. Blood samples from 95 individuals were obtained and processed 
by ion exchange chromatography methods (Qiagen) to isolate DNA. Blood samples were obtained under consideration 
of all legal, ethical and medical requirement of Parexel International (Berlin). 
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Example 1 : Genomic organization and oligonucleotides for the amplification of the coding regions of CYP3A4, 
CYP3A7 and hPXR 

[0079] The genomic structure of CYP3A4 has been described in an earlier work (Hashimoto, Eur. J. Siochem. 218 

5 (1993), 585-95); however, the published exon flanking sequences were too short to design oligonucleotides for exon 
amplification. In accordance with the present invention these sequences have been elucidated first by sequencing of 
PCR (Polymerase Chain Reaction)-amplified fragments containing parts of two neighboring exons and the intercalated 
intron. The primers used for the amplification of these fragments were derived from the published cDNA sequence 
(Beaune, Proc. Natl. Acad. Sci. USA 83 (1986), 8064-8, GenBank accession number M 14096) upon consideration of 

10 the exon/intron organization of the gene (Table 1, Fig. 3). For exons flanked by larger introns, CYP3A4-containing 
bacterial artificial chromosome (BAC) clones (Control numbers 22300 and 22301) purchased from Genome Systems 
(St. Louis, MO, USA) were directly sequenced. The clones had been isolated by PCR using the CYP3A4PF and 
CYP3A4PR oligonucleotides (Table 1), which were derived from the promoter region of the gene (GenBank accession 
number D11131, Hashimoto, supra). The sequences thus obtained were used to design oligonucleotides for the am- 

15 plification of the gene exons (Fig. 1 ). Their sequences, the amplification conditions, and the sizes of the resulting DNA 
fragments are given in Table 2. Besides the exon sequences, fragments amplified contain also some flanking intronic 
sequences, including the splice sites. The oligonucleotide 3A425R was derived from the non-coding part of CYP3A4 
cDNA (exon 1 3, Beaune, supra, GenBank accession number M 14096). The sequences of oligonucleotides to determine 
the exon/intron boundary of Exon 11 of CYP3A7 (3A720R and 3A721F) are shown in bold in Table 1. 

20 in accordance with the present invention a similar strategy was applied to elucidate the genomic organization and to 
determine exon-flanking sequences of the CYP3A4 and CYP3A7 regulator, hPXR (Fig. 4A, B, Table 3). The gene 
consists of 10 exons and spans at least 19 kb of genomic DNA. The sequences at exon-intron junctions as well as 
exon and intron sizes are given in Table 4. Exons 1 band 1 a are utilized alternately in the formation of hPXR transcripts. 
This is evidenced by the amplification of liver and intestine cDNA pools (Clontech) with primers derived from exons 

25 1b, 1a, and 2 (Fig. 4C), followed by sequencing. In agreement with this observation, exon 1A lacks a 3* consensus 
splice site. Furthermore, exon 1 B and 1 A sequences correspond to the divergent 5'sequences of the two cDNA hPXR 
clones described by Bertilsson et al (23) (GenBank accession numbers AF084645 and AF084644). The two transcripts 
are differentially expressed in the target tissues of hPXR. Exon 1 b is expressed both in the liver and in the small intestine 
whereas exon 1a is expressed only in the latter tissue (Fig. 1B). 

30 [0080] The sequences thus obtained were used to design oligonucleotides for amplification of the gene exons. Their 
composition and the sizes of the resulting DNA fragments are given in Table 5. Besides the exon sequences, fragments v 
amplified contain also some flanking intronic sequences, including the splice sites, as well as some 5'- and 3-UTR 
(untranslated region) sequences of the gene. 

35 Example 2: Isolation of genomic DNA, amplification) purification and sequencing of CYP3A4, CYP3A7 and 
hPXR gene fragments 

[0081] Genomic DNA was isolated using standard techniques from 22 liver samples and 95 blood samples obtained 
from unrelated Caucasians. Conditions for the amplification of CYP3A4, CYP3A7 and hPXR gene fragments by PCR 
40 . are given in Tables 2 and 5, respectively. The complete sequences of the amplicons are given in Figure 8. The quality 
of amplicons was routinely checked by agarose gel electrophoresis. The fragments were then processed through PCR 
purification columns (Qiagen) which remove all the components of the PCR that could otherwise interfere with the 
subsequent sequencing reaction. ■ ■ > 

[0082] The sequencing reaction was performed using the dye-terminator method and the samples were then resolved 
45 on polyacrylamide gels (Perkin-Elmer 377 and 3700 sequencing machines). Both strands were routinely sequenced 
to assure high accuracy of the results and the detection of heterozygotes. The sequences were visually inspected for 
their quality and then analyzed for the presence of polymorphisms using the PHRED/PHRAP/POLYPHRED/CONSED 
software package (University of Washington, Seattle, USA). 

50 Example 3: Polymorphisms in the CYP3A4 and CYP3A7 gene 

[0083] In accordance with the present invention, 22 or more DNA samples were screened for mutations in exons 3, 
4, 5, 7 and 9 of the CYP3A4 gene, and in exon 11 of the CYP3A7 gene. Three novel mutations were detected in the 
samples screened, two of them result in non-conservative amino acid exchanges (Fig. 5, Table 6, Table 8, Figure 8). 
55 The numbering of nucleotide polymorphisms within the CYP3A4 transcript, and of positions of the resulting amino acid 
exchanges, is based on GenBank sequence M 14096. For CYP3A7 GenBank sequence gi 4503232 was used as ref- 
erence. 



15 



EP 1 088 900 A1 



a G235A single nucleotide polymorphism (SNP) in exon 3 of CYP3A4 in 6 of the 234 chromosomes screened. 
The SNP results in the non-conservative amino acid exchange Gly-»Asp at position 56 of the CYP3A4 protein, 
a G-»T SNP in intron 7 of CYP3A4 in 4 out of 44 chromosomes screened. The SNP is localized at position 33 of 
the intron sequence (the G of the 5' consensus splice site GT is taken as position 1). 
5 . • a C1229G SNP in exon 11 of CYP3A7 in 17 out of 232 chromosomes screened. The SNP results in the non- 
conservative amino acid exchange Thr-^Arg at position 409 of the CYP3A7 protein. 

Example 4: Polymorphisms in the hPXR gene 

10 [0084] In accordance with the present invention 22 DNA samples were screened for mutations in exons 1b, 2, 5/6, 

7, 8, 9 and 9a of the hPXR gene. Three polymorphisms were detected in the samples screened (Fig. 6, Table 7, Table 

8, Figure 8). The numbering of nucleotide polymorphisms within the hPXR transcript, and of positions of the resulting 
amino acid exchanges is based on GenBank hPXR sequence AF084645. 

A C149A SNP in exon 1b in 27 out of 44 chromosomes screened. Exon 1b is untranslated, i.e. the SNP has no 
effect on the amino acid sequence of the protein. 

a CC975TT SNP in exon 5/6 in 2 out of 44 chromosomes screened. The SNP is silent, i.e. it has no effect on the 
amino acid sequence of the protein. 

A C->T SNP in intron 7 in 8 out of 44 chromosomes screened. The SNP is localized at position 184 of the intron 
sequence (the G of the 5' consensus splice site GT is taken as position 1). 

Example 5: An enzymatic test for the CYP3A7 exon 11 C1229G polymorphism 

[0085] The C1229G polymorphism detected in exon 11 of CYP3A7 results in the loss of an Alwnl restriction site (Fig. 
25 7A). In accordance with the present invention an Alwnl-based test was developed for the genotyping of the C1229G 
allele. An example is shown in Fig. 7B. A digest of the 404 bp genomic fragment amplified from a wild-type sample 
(wt/wt) with primers 3A742F and 3A738R (Table 2) generates two fragments of 316 bp and 88 bp, respectively. In a 
heterozygous sample (wt/C1229G), approximately half of the DNA remains undigested, due to the loss of the restriction 
site in the mutant allele. 

30 [0086] Because population genetics enables a calculation of the expected frequency of homozygous vs. hetero- 
zygous alleles of a defined gene (Hardy Weinberg formula, p e2 + 2pq + q e2 = 1 ), it is also possible to confirm the 
predicted (with that formula) distribution of homozygous vs. heterozygous alleles and deviations with the experimental 
findings. This can serve as internal control and confirmation that a detected sequence deviation indeed represents a 
novel allele. 

35 



Table 1. 



40 r . 



50 



Oligonucleotides used to determine the exon/intron boundaries of the CYP3A4 gene and of the exon 11 of 
CYP3A7 (in bold). 


Name 


Position 


Sequence (5' - 3') 


T7 ' 


BAC CS 


TAATACGACTCACTATAGGG 


3A43F 


exon 2 


GAACCCATTCACATGGAC 


3A46R 


exon 4 


TGATCATGTCAGGATCTG 


3A47F 


exon 4 


GGTCAACAGCCTGTGCTG 


3A48R, 


exon 5 


TCCACTGGTGAAGGTTGG 


3A49F 


exon 5 


GTGCCATCTCTATAGCTG 


3A410R 


exon 6 


CTTCCCGCCTCAGATTTC 


3A411F 


exon 6 


GAAATCTGAGGCGGGAAG 


3A412R 


exon 7 


r GGGTCTTGTGGATTGTTG 


3A413F 


exon 7 


' C AAC AATCC AC AAG ACCC 


3A414R 


exon 8 


GTGTATCTTCGAGGCGAC 


3A415F 


exon 8 


CTTTCCATTCCTCATCCC 
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Table 1. (continued) 



Oligonucleotides used to determine the exon/intron boundaries of the CYP3A4 gene and f the exon 11 of 
CYP3A7 (in bold). 


Name 


Position 


Sequence (5' - 3") 


3A416R 


exon 9 


CCTTTGTGGGACTCAGTTTC 


3A419F 


' exon 10 


GCCACTCACCCTGATGTC 


3A720R 


exon 11 


ATCACCACCCACCCTTTG 


3A721F 


exon 11 


CAAAGGGTGGGTGGTGAT 


3A422R 


exon 12 


GAGAGCAAACCTCATGCC 


3A423F 


exon 12 


GGCATGAGGtTTGCTCTC 


3A424R 


exon 13 


GGTGCCATCCCTTGACTC 


3A426R 


exon 2 


GCAGAGGTGTGGGCCCTG 


3A4436F 


intron 8 


GGAGATCAAGGACCACGCTTGTG 


3A441R 


intron 10 


CTTACGCTTCTGCCAGTAGCAACC 


CYP3A4PF 


promoter 


AACAGGCGTGGAAACACAAT 


CYP3A4PR 


promoter 


CTTTCCTGCCCTGCACAG 



[0087] Fifty ng of genomic DNA was added to a reaction mix (total volume 30 or 50 containing 1x PGR buffer 
(Q=Qiagen, Cat.Nr: 1005479, or B2=Boehringer (currently Roche) Expand Long Template PCR Buffer number 2, Cat. 
Nr. 1742655), 0.25 uMeach oligonucleotide (Metabion), 200 |xM dNTPs, and 1 U of Taq polymerase (Qiagen). Ampli- 
fications were performed on a RoboCycler Gradient 96 (Stratagene) with an initial denaturation step of 2 min. at 94 °C 
followed by 32 amplification cycles of denaturation (40 sec, 94 °C), annealing (45 sec, temperatures 56-60 °C), and 
extension (60-150 sec, 72 °C). This was followed by a final extension step 5,min., 72 °C The sequencing of PCR 
fragments and BAG clones was performed on a GeneAmp PCR System 9700 (Perkin-Elnier) using a dye-terminator 
DNA sequencing kit. (Perkin-Elmer, Cat.Nr. 4303154), according to manufacturers^ instructions. 
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[0088] Fifty ng of genomic DNA was added to a reaction mix containing 1x PGR buffer (Qiagen), 0.5 ^iM oligonucle- 
otides, 200 u.M dNTPs, and 1 U of Taq polymerase (Qiagen). Amplifications were performed on a RoboCycler Gradient 
96 (Stratagene) with an initial denaturation step of 2 min. at 94 °C followed by 32 amplification cycles of denaturation 
(40 sec, 94 °C), annealing (45 sec, temperatures given above), and extension (60 sec, 72 °C). This was followed by 
5 a final extension step 5 min. t 72 °C All sequencing reactions were performed on a GeneAmp PCR System 9700 (Perkin- 
Elmer) using a dye-terminator DNA sequencing kit (Perkin-Elmer, catalog number4303154), according to manufactur- 
ers^ instructions. 



Table 3. 



10 



15 



20 



25 



30 



35 



Oligonucleotides used to determine the structure and exon/intron boundaries of the hPXR gene. 


Name 


Position 


Sequence (5* - 3') 


KPXR1F 


exon4 


TCATGTCCG ACGAGGCCG 


hPXR4F 


exon 5/6 


CCCACATGGCTGACATGT 


hPXRSF 


exon 7 


CCCATCGAGGACCAGATC 


hPXR6R 


exon 7 


GTCTTCCAAGCAGTAGGA 


hPXR7R 


exon 8 


CAGCATGGGCTCCAGTAG 


. hPXRIOR 


exon 9a 


CCTGTGATGCCGAACAAC 


hPXR11F 


exon 9 


CATTGAATGCAATCGGCC 


hPXR12R 


exon 9a 


G CTCTTG G C AGTGTCC AT 


hPXR15F 


exon 2 


GGAAAGCCCAGTGTCAAC 


hPXR16F 


exon 3 


CCATGAAACGCAACGCCC 


hPXR18R 


exon 2 


C CTTG C ATCCTTC AC ATG 


hPXR19R 


exon 3 


CATGCCGCTCTCCAGGCA 


hPXR20R 


exon 4 


CGGCCTCGTCGGACATGA 


hPXR21R 


exon 5/6 


. ACATGTCAGCCATGTGGG 


hPXR47F 


exon 1b 


CAAGCCAAGTGTTCACAGTG 


hPXR48R 


exon 1b 


CACTGTGAACACTTGGCTTG 


hPXR52F 


exon 1a 


CAAGGACAGCAGCATGACAGTCAC 


hPXR54R 


exon 1a 


AGCCAACTCAGCCGCAGC 



40 [0089] Fifty ng of genomic DNA was added to a reaction mix (total volume 30 or 50 containing 1x PCR buffer 
(Q=Qiagen, Cat.Nr. 1005479, or B2=Boehringer (currently Roche) Expand Long Template PCR Buffer number 2, Cat. 
Nr. 1742655), 0.25 each oligonucleotide (Metabion), 200 u M dNTPs, and 1 U of Taq polymerase (Qiagen). Ampli- 
fications were performed on a RoboCycler Gradient 96 (Stratagene) with an initial denaturation step of 2 min. at 94 °C 
followed by 32 amplification cycles of denaturation (40 sec, 94 °C), annealing (45 sec, temperatures 56-60 °C), and 

45 extension (60-150 sec, 72 °C). This was followed by a final extension step 5 min., 72 °C All sequencing reactions 
were performed on a GeneAmp PCR System 9700 (Perkin-Elmer) using a dye-terminator DNA sequencing kit (Perkin- 
Elmer, Cat.Nr. 4303154), according to manufacturers^ instructions 



Table 4. 



55 



Expn-intron organization of the hPXR gene. 


Exon number 


Exon size (b) 


Sequence at exon-intron junction 


Intron size (kb) 






5* splice site 


3* splice site 




1b 


281 


AAGCAG gtatgg.... 




0.403 


1a 


144 


AAACCAgtgagt.... 


....ttctag TCCAAG 


7.0 


2 


219 


TTTCAG gtagag.... 


....tcacag GAGGGC 


-2.7 



19 
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Table 4. (continued) 



Exon-intron organization of the hPXR gene. 


Fynn numhpr 




Sequence at exon-intron junction 


Intron ^i7f» 


3 


134 


AGGAGA gtgagc.... 


.... ctgcag TGATCA 


-1.2 


4 


188 


TTCCGG gtagga.... 


....tcctag CTGCCA 


~ 1 .0 


5/6 


275 


CTTCAG gtagga.... 


....tgccag GGACTT 


~ 1.9 


7 


143 


CTGCAG gtgccc.... 


....ccacag GTGGCT 


0.201 


8 


117 


CCCCAG gtgagg.... 


....ctccag ACCGCC 


0.286 


9 


106 


TCATAG gtgagc... 


....atgcag GTTCTT 


~ 1.3 


9a 


1418 









[0090] The sizes of exons 1 b and 1 a based on a comparison of genomic (Figure 8) ? and cDNA (GenBank accession 
numbers AF061 056.1 , AF084645 AF084644), hPXR sequences. 
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Table 6. 





Distribution of polymorphisms in CYP3A4 (exons 3, 4, 5, 7 and 9) and CYP3A7 (exon 11, in bold) in a set of 22 


5 


DNA samples. 








Number 


Donor 


Exons 










Number 














1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


10 


1 


11 






























2 


17 






























3 


18 






















X 








4 


19 




























15 


1 5 


20 






























6 


21 






























7 


22 




























20 


8 


23 






























9 


24 














X 








X 








10 


25 






















X 








11 


27 




























25 


12 


29 






























13 


31 






























14 


33 




























30 


15 


34 






























16 


35 






















X 








17 


36 




























35 


18 


37 




























19 


38 














X 
















20 


40 






X 
























21 


41 














X 








X 






40 


22 


43 














X 








X 








M -"=wild-type, n X n 


=heterozygbte, "XX* -homozygote 







Table 7. 



55 



Distribution of hPXR gene polymorphisms in a set of 22 DNA samples. 


Number 


Donor 


Exons 




Number 


1b 


1a 


2 


3 


4 


5/6 


7 


8 


9 


9a 


1 


11 














X 








2 


17 


X 




















3 


18 


XX 




















4 


19 


X 




















5 


20 


X 




















6 


21 


XX 












X 









22 
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Table 7. (continued) 



Distribution of hPXR gene polymorphisms in a set of 22 DNA samples. 



Number 


Donor 


Exons 




Number 


ID 


i a 


o 

£. 


o 
o 


A 


O/D 


7 


o 
o 


Q 

y 


Qa 
pa 


7 


22 


WW 

AA 




















8 


23 


YY 
AA 




















9 


24 


V V 
A A 




















10 


25 


Y 
A 




















11 


27 












AA 


V V 
AA 








12 


29 


V V 
AA 












v 
A 








13 


31 


v 
A 












v 
A 








14 


33 






















15 


34 


X 




















16 


35 


XX, 




















17, 


36 


X 




















18 


37 


X 




















19 


38 














X 








20 


40 


XX 












X 








21 


41 


X 




















22 


43 


XX 





















"-^wild-type, "X^heterozygote, "XX"=homozygote 
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Table 8. Genetic variants of CYP3A4, CYP3A7 and hPXR. 



Gene Position wt sequence (5'-3') 

CYP3A4 exon 3 F: TCCCAGGGCTTTTGT 

R: ACAAAAGCCCTGGGA 



mut sequence (5'-3') 

F: TCCCAGGACTTTTGT 
R: ACAAAAGTCCTGGGA 



intron 7 f : tatcttictctctt 

R: AAGAGAGAAAGATA 



F : . TATCTTSCTCTCTT 
R: AAGAGAGCAAGATA 



CYP3A7 exon 11 f : tactggacagagc 

R: GCTCTGTCCAGTA 



F: TACTGGAGAGAGC 
R: GCTCT.CTCCAGTA . 



hPXR exon 1b F: cctgaacaaggcag 

R: CTGCCTTGTTCAGG 



F: CCTGAA&AAGGCAG 
R: CTGCCTTTTTCAGG 



exon 5/6 F : CAGTGGCGGGAAA 
R: TTTCCCGCCACTG 



F: CAGTGGTGGGAAA 
R: TTTCCCACCACTG 



intron 7 f : cctccatcctgttac 

R: GTAACAGfiATGGAGG 



F: CCTCCATJCTGTTAC 
R: GTAACAG&ATGGAGG 
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SEQUENCE LISTING 

<110> EPIDAUROS Biotechnologie Aktiengesellschaf t 

<120> Polymorphisms in the human CYP3A4, CYP3A7 and hPXR 
genes and their use in diagnostic and therapeutic 
applications 

<130> D2145EP 

<140> 
<14.1> 

<160> 135 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 1 

taatacgact cactataggg 



<210> 2 
<211> 18 
<212> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 2 

gaacccattc acatggac 



<210> 3 
<211> 18 
<212> DNA . 

<213> Artificial Sequence 
<220> ' 

<223> Description of Artificial Sequence: DNA 
<400> 3 

tgatcatgtc aggatctg 



<210> 4 
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<211> 18 
<212> DNA 

<213> Artificial Sequence 



5 



<220> 

<223> Description of Artificial Sequence: DNA 



10 



<400> 4 

ggtcaacagc ctgtgctg 



18 



15 



<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 4 



<220> 

<223> Description of Artificial Sequence: DNA 



20 



<400> 5 

tccactggtg aaggttgg 



18 



25 



<210> 6 
<211> 18 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 6 

gtgccatctc tatagctg 18 



<210> 7 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



<210> 8 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<22G> 

<223> Description of Artificial Sequence: DNA 



<400> 7 

cttcccgcct cagatttc 



18 



45 



55 
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10 



15 



25 



30 



35 



40 



45 



<400> 8 

gaaatctgag gcgggaag 18 



<210> 9 
<211> 18 
<212> DNA 

<213> .Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 9 

gggtcttgtg gattgttg 18 



<210> 10 
<211> 18 
20 ■ <212> DNA 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 10 

caacaatcca caagaccc 18 



<210> 11 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 11 

gtgtatcttc gaggcgac 18 



<210> 12 
. <211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 

<400> 12 1 
50 ctttccattc ctcatccc 



<210> 13 



55 



28 
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<211> 20 

<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> Description of Artificial Sequence: DNA 

<400> 13 

10 cctttgtggg actcagtttc 20 



15 



20 



30 



35 



50 



<210> 14 

<211> 18 ' 

<212> DNA ' 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 
<400> 14 

gccactcacc ctgatgtc 18 



<210> 15. 
25 <211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 15 . 

atcaccaccc accctttg 18 

<210> 16 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

40 . <220> 

<223> Description of Artificial Sequence: DNA 

<400> 16 

caaagggtgg gtggtgat 18 

45 

<210> 17 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



55 



29 



EP 1 088 900 A1 



<400> 17 

gagagcaaac ctcatgcc 



<210> 18 
<211> 18 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 



18 



15 



25 



35 



45 



50 



55 



<400> 18 

ggcatgaggt ttgctctc 18 



<210> 19 
<211> 18 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 



<400> 19 

ggtgccatcc cttgactc 18 



30 <210> 20 

<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: DNA 
<400> 20 

gcagaggtgt gggccctg 18 



<210> 21 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 

<400> 21 

ggagatcaag gaccacgctt gtg 23 



<210> 22 



30 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 22 

cttacgcttc tgccagtagc aacc ,24 

<210> 23 
<211> 20 

<212> DNA ; 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 
20 <400> 23 

aacaggcgtg gaaacacaat 20 

<210> 24 
25 <211> 18 

<212> DNA 

<213> Artificial Sequence 



10 



15 



30 



35 



<220> 

<223> Description of Artificial Sequence: DNA 
<400> 24 , 

ctttcctgcc ctgcacag 18 

<210> 2S • 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

40 <220> 

<223> Description of Artificial Sequence: DNA 

<400> 25 f ^ 

cctctaactg ccagcaagtc tg 22 

45 

<210> 26 
<21i> 21 
> <212> DNA 
50 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 

55 



31 



EP 1 088 900 A1 



<400> 26 

gcgctgagac tgtcctctgt g 21 



<210> 27 
<211> 24 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



15 



<400> 27 

agtctggctt cctgggttgg gctc 



24 



<210> 28 
<211> 26 
20 <212> DNA 

<213> Artificial Sequence 



25 



<220> 

<223> Description of Artificial Sequence: DNA 
<400> 28 

gaagtggacg tggaaccttc ctggac 



26 



30 



<210> 29 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



35 



<220> 

<223> Description of Artificial Sequence: DNA 



40 



45 



<400> 29" 

ctacaaccat ggagacctcc 



<210> 30 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence: DNA 



20 



50 



<400> 30 

tacctgtccc caccagattc 



20 



<210> 31 



55 



32 
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<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

■<223> Description of Artificial Sequence: 
<400> 31 

gtctgtcttg aetggacatg tgg 

<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence 
<400> 32 

gatgatggtc acacatatct tc 



<210> 33 
<211> 23 

<212> DNA . : ' . 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence 
<400> 33 

ggagatcaag gaccacgctt gtg 

<210> 34 • 
<21i> '2 5 
<212> DNA 

<213> Artificial Sequence, 
<220> , ' 

<223> Description of Artificial Sequence 
<400> 34 ' ; 

ctcatcatcc tggaatactt. cctgc 

<210> 35 " • 

<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 



33 
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<400> 35 

ccagtatgag ttgttctctg, g 

<210> 36 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
<400> 36 

aggcagaata tgcttgaacc aggc 

<210> 37 
<211> 18 
<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence 
<400> 37 

tcatgtccga cgaggccg - 

<210> 38 v 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 

<400> 38 

cccacatggc tgacatgt 

<210> 39 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 

<400> 39 

cccatcgagg accagatc 

<210> 40 



34 
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<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220>, 

<223> Description of Artificial Sequence: DNA 
<400> 40. 

gtcttccaag cagtagga 18 



15 



<210> 41 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA * 
20 <400> 41 

cagcatgggc tccagtag 18 



<210> 42 

25 <211> 18 ' 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 
<400> 42 

cctgtgatgc cgaacaac 18 

<210> 43 
<211> 18' 
<212> DNA 

<213> Artificial Sequence 

40 <220> 

<223> Description of Artificial Sequence: DNA 

<400> 43 

cattgaatgc aatcggcc 18 

45 



30 



35 



<210> 4 4 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



35 
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10 



15 



20 



25 



30 



35 



40 



45 



<400> 44 

gctcttggca gtgtccat . 18 



<210> 45 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 45 

ggaaagccca gtgtcaac 18 



<210> 46 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DMA 
<400> 46 

ccatgaaacg caacgccc 



<210> 47 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 47 

ccttgcatcc ttcacatg 18 



<210> 48 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 

<400> 48 
so catgccgctc tccaggca 



<210> 49 



55 
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jo 



15 



<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 49 

-cggcctcgtc ggacatga 



<210> 50 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: DNA 
20 <400> 50 

acatgtcagc catgtggg 18 



<210> 51 
<211> 20 
. <212> DNA 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 
<400> 51 

caagccaagt gttcacagtg 20 

<210> 52 
<211> 20 
<212> DNA , 

<213> Artificial Sequence 

40- <220> 

<223> Description of Artificial Sequence: DNA 

<400> 52 

cactgtgaac acttggcttg 2 0 



25 



30 



35 



45 



<210> 53 . . 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



55 
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<400> 53 

caaggacagc agcatgacag tcac 



<210> 54 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
<400> 54 

agccaactca gccgcagc 

<210> 55 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 

<400> 55 
aagcaggtat gg 

<210> 56 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 

<400> 56 
aaaccagtga gt 

<210> 57 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 

<400> 57 
ttctagtcca ag 

<210> 58 



38 
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<211> 12 

<212> DNA . ' , ; 

<213> Artificial Sequence , 

<220> 

<223> Description of Artificial Sequence: DNA* 
<400> 58 

tttcaggtag ag 12 

<210> 59 ' * 

<211> 12 • 
.' <212> DNA ' • ' • * 

<213> Artificial' Sequence 

<220> ' > • 

<223> Description of Artificial Sequence: DNA 

<400> 59 , 

tcacaggagg gc , 12 

<210> 60 ' " • * 

<211> 12 '• ' . 

<212> DNA ' , 

<213> Artificial Sequence , 

<220> ■ , " . _ . . . • * . , ' ■ ' " 

<223> Description of Artificial Sequence: DNA 

<400> 60 ' . v 

aggagagtga gc 12 

<210> 61 ' 1 ' 

<211> 12 
<212> DNA 

<213> Artificial Sequence 

<220> ' , , 

<223> Description of Artificial Sequence: DNA 

<400> 61 

ctgcagtgat ca 12 

<210> 62 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



39 
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15 



20 



25 



<400> 62 

ttccgggtag ga 12 

5 . . ' ■ ' 

<210> 63 
<211> 12 
<212> DNA 
w <213> Artificial Sequence 

<220> 

<223> ; Description of Artificial Sequence: DNA 
<400> 63 

tcctagctgc ca 12 

<210> 64 
<211> 12" 
<212> DNA 

<213> Artificial Sequence 

<220> • '^rtiA 

<223> Description of Artificial Sequence: DNA 

<400> 64 

cttcaggtag ga 12 

30 ■ <210> 65 

<211> 12 
<212> .DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 65 

tgccagggac tt 12 

<210> 66 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 66 

50 ctgcaggtgc cc 12 

<210> 67 • 

55 



35 



40 



45 



40 
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<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 67 

ccacaggtgg ct 12 

<210> 68 

<211> 12 

<212> DNA , 

<213> Artificial Sequence * ^ 

<220> 

<223> Description of Artificial Sequence: DNA 

20 <400> 68 

ccccaggtga gg 12 



10 



15 



30' 



35 



<210> 69 
25 <211> 12 

<212> DNA 

<213> Artificial Sequence 
■ . <220> 

<223> Description of Artificial Sequence: DNA 
<400> 69 

ctccagaccg cc 12 

. <210> 70 
<211> 12 
<212> DNA 

<213> Artificial Sequence 

40 <220> 

<223> Description of Artificial Sequence: DNA 

<400> 70 

tcataggtga gc 12 

45 

<210> 71 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



50 



55 



<220> 

<223> Description of Artificial Sequence: DNA 



41 
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<400> 71 

atgcaggttc tt 12 

<210> 72 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 72 

tcaagtgctg gacttgggac 20 



73 
20 
DNA 

Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 73 

cccactatga tgctgacctc 20 



20 



<210> 
<211> 
<212> 
<213> 



30 



35 



40 



45 



50 



<210> 74 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 74 

ctgaggcctc tacacatc , 18 



<210> 75 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description ■ of Artificial Sequence: DNA 
<400> 75 

aggccctgag atgttacc 18 



<210> 76 



55 
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<211> 18 ' 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 76 

ctgagttggg acctgtct 18 

<210> 77 
<211> 18 

<212> DNA . 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 

20 <400> 77 

ccaggccctt tgaacctc 18 

<210> 78 
25 <211> 18 

<212> DNA 

<213> Artificial Sequence 



10 



15 



30 



35 



<220> 

<223> Description of Artificial Sequence: DNA 
<400> 78 

ctgctggtgc cggcctgt 18 

<210> 79 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

™ <220> ' 

<223> Description of Artificial Sequence: DNA 

<400> 79 

gactgggacc ttccctgg 18 

45 

<210> 80 
<211> 18 
<212> DNA 

50 <213> Artificial Sequence * 

<220> 

<223> Description of Artificial Sequence: DNA 

55 . 



43 
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<400> 80 

gagcaatgcc ctgactct 18 

<210> 81 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 

<400> 81 

ccctctggcc atgaagtc ' 18 

<210> 82 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 

<400> 82 

tgcttgtgca gcctcaga 18 

<210> 83 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 

<400> 83 

gctcttggca gtgtccat 18 

<210> 84 

<211> 15 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 

<400> 84 

50 tcccagggct tttgt 15 

<210> 85 

55 ' 



10 



15 



20 



25 



30 



35 



40 



45 
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<211> 15 
<212> DNA 

<213> Artificial Sequence 



5 



<220> 

<223> Description of Artificial Sequence: DNA 



w 



<400> 85 

acaaaagccc tggga 



15 



15 



<210> 86 
<211> 15 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: DNA 



20 



<400> 86 

tcccaggact tttgt 



15 



<210> 87 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 87 

acaaaagtcc tggga 15 

<210> 88. 
<211> 14, 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



<210> 89 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



<400> 88 
tatctttctc tctt 



45 



55 
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10 



25 



30 



35 



40 



45 



50 



55 



<400> 89 

aagagagaaa gat 13 



<210> 90 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 90 

tatcttgctc tctt 14 



<210> 91 
<211> 14 
20 <212> DNA 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 91 

aagagagcaa gata 14 



<210> 92 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 92 

tactggacag age 13 



<210> 93 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 93 

gctctgtcca gta 13 



<210> 94 



46 
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<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 
<400> 94 

tactggagag age 13 

<210> 95 

<211> 13 

<212> DNA ' 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA 

20 <400> 95 

gctctctcca gta 13 

<210> 96 
25 <211> 14 

<212> DNA 

<213> Artificial Sequence 



10 



15 



30 



35 



<220> 

<223> Description of Artificial Sequence: DNA 
<400> 96 

cctgaacaag gcag 14 

<210> 97 
<211> 14 
<212> DNA 

<213> Artificial Sequence 

40 <220> 

<223> Description of Artificial Sequence: DNA 

<400> 97 

ctgccttgtt cagg l 4 



45 



50 



<210> 98 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA 



55 
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<400> 98 
cctgaaaaag gcag 

<210> 99 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

<400> 99 
ctgccttttt cagg 

<210> 100 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

<400> 100 
cagtggcggg aaa 

<210> 101 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

<400> 101 
tttcccgcca ctg 



<210> 102 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

<400> 102 
cagtggtggg aaa 

<210> 103 



48 
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<211> 13 
<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> Description of Artificial Sequence: DNA 
<400> 103 

w tttcccacca ctg 13 

<210> 104 
<211> 15 
<212> DNA 

<213> Artificial Sequence 



15 



<220> 

<223> Description of Artificial Sequence: DNA 

20 <400> 104 

cctccatcct gttac 15 

,<210> 105 ' 
25 " <211> 15 

<212> DNA 
. <213>. Artificial Sequence 



30 



35 



<220> 

<22.3> Description of Artificial Sequence: DNA 
<400> 105 

gtaacaggat ggagg 15 

<210> 106 
<211> 15 
<212> DNA 

<213> Artificial Sequence 

40 <220> 

<223> Description of Artificial Sequence: DNA 

<400>. 106 

cctccattct gttac f 15 

45 

<210> 107 
<211> 15 

<212> DNA ■ ' 

50 <213> Artificial Sequence 

<220> 

<223> Description .of Artificial . Sequence : DNA 



55 
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10 



15 



30 



35 



40 



<400> 107 

gtaacagaat ggagg 15 



<210> 108 
<211> 249 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (11.5) 

<220> 

<221> exoh 

<222> (116) . . (168) 



<220> 

20 <221> intron 

<222> (169) . . (249) 

<220> 

<221> CDS 

25 <222> (116).. (166) 



<40.0> 108 

cctctaactg ccagcaagtc tgatttcatt ggcttcgact gttttcatcc caattagagg 60 

cagggttaag tacattaaaa ataataatca aatattattt tgtttctcct cccag grc 118 

Xaa 
1 

ttt tgt atg ttt gac atg gaa -tgt cat aaa aag tat gga aaa gtg'tgg 166 
Phe Cys Met Phe Asp Met Glu Cys His Lys Lys Tyr Gly Lys Val Trp 
5 10 15 

gggtgagtat tctggaaact tccattggat agacttgttt ctatgatgag tttaccccac 226 

tgcacagagg acagtctcag ccc 249 



<210> 109 

<211> 17 

<212> PRT 

45 <213> Homo sapiens 



50 



<400> 109 

Xaa' Phe Cys Met Phe Asp Met Glu Cys His Lys Lys Tyr Gly Lys Val 
1 5 10 15 

Trp 



55 
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<210> 110 
<211> 293 
<212> DNA 
<213> Homo sapiens 

<220> . • 

<221> intron 
<222> (1) . . (77) 

<220> . 

<221> exon 

<222> (78) . . (177) 

<220> 

<221> intron 
<222> (178) . . (293) 

<220> 

<221> CDS 

<222> (79) . . (177) 

<4 00> 110 ' 

agtctggctt cctgggttgg gctccagctg. tagaataagg ctgttgatgt ttaatcaact 60 

ctgttttttt cacacagc ttt tat gat ggt caa cag cct gtg ctg get ate 111 

Phe Tyr Asp Gly Gin Gin Pro Val Leu Ala lie 
1 5 10 

aca gat cct gac atg ate aaa aca gtg eta gtg aaa gaa tgt tat tct 159 
Thr Asp Pro Asp Met lie Lys Thr Val Leu Val Lys Glu Cys Tyr Ser 
15 20 25 

gte ttc aca aac egg agg gtaagcattc atgtgttgaa attaaaatac 207 
Val Phe Thr Asn Arg Arg 
30 

tgattgatta aatttatatt ttgaaattct tatatattca tagacagttg cctaaaaaat 267 
gtccaggaag gttccacgtc cacttc 293 



<210> 111 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 111 

Phe Tyr Asp Gly Gin Gin Pro Val Leu Ala lie Thr Asp Pro Asp Met 
1 5 10 15 

lie Lys Thr Val Leu Val Lys Glu Cys Tyr Ser Val Phe Thr Asn Arg 
20 25 ' 30 
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Arg 



<210> 112 
<211> 236 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (61) 

<220> 

<221> exon 

<222> (62) . . (175) 



20 <220> \ 

<221> intron 
<222> (176) . . (236) 

<220> 
25 <22l> CDS 

<222> (62).. (175) 



10 



15 



30 



35 



<400> 112 • 

ctacaaccat ggagacctcc acaactgatg taggacaaaa ' tgtttctgct ttgaactcta 60 

g cct. ttt ggt cca gtg-gga ttt atg aaa agt gcc ate tct ata get gag 109 
Pro Phe Gly -Pro Val Gly Phe Met Lys Ser Ala He Ser He Ala Glu 
1 5 10 15 

gat gaa gaa tgg aag aga tta cga tea ttg ctg tct cca acc ttc acc 157 
Asp Glu' Glu Trp Lys Arg Leu Arg Ser Leu Leu Ser Pro Thr Phe Thr 
20 25 30 



agt. gga aaa etc aag gag gtatgaaaat aacatgagtt ttaataagaa 205 
Ser Gly Lys Leu Lys Glu 
40 ■ ■ 35 

acttaaagaa tgaatctggt ggggacaggt a * 236 

45 <210> 113 

<211> 38 
<212> PRT 

<213> Homo sapiens 

50 <400> 113 

Pro Phe Gly Pro Val Gly Phe Met Lys Ser Ala lie Ser He Ala Glu 
1 5 10 15 



55 
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Asp Glu Glu Trp Lys Arg Leu Arg Ser Leu Leu Ser Pro The Phe Thr 
20 . 25 30 

Ser Gly Lys Leu Lys Glu 
35 



10 



15 



30 



35 



<210> 114 
<211> 393 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (98) 



<220> 

<221> exon 
20 <222> (99) . . (247) 

<220> 

<221> intron 
<222> (248) . . (393) 

25 

<220> 

<221> CDS 

<222> (100) . . (246) 

<400> 114 

gtctgtcttg actggacatg tggctttcct gatgeacgea tagaggaagg atggtaaaaa 60 

ggtgctgatt ttaattttcc acatctttct ccactcagc gtc ttt ggg gec tac 114 

Val Phe Gly- Ala Tyr 
1 5 

age atg gat gtg ate act age aca tea ttt gga gtg aac ate gac tct 162 
Ser Met Asp Val He Thr Ser Thr Ser Phe Gly Val Asn He Asp Ser 
10 15 20 

id etc aac aat cca caa gac ccc ttt gtg gaa aac acc aag aag ctt tta 210 

Leu Asn Asn Pro Gin Asp Pro Phe Val Glu Asn Thr Lys- Lys Leu Leu 
25 30 35 

aga ttt gat ttt ttg gat cca ttc ttt etc tea ata agtatgtgga , 256 

45 Arg Phe Asp Phe Leu Asp Pro Phe Phe Leu Ser lie 

4 0 45 

ctactatttc cttttattta tcttyctctc ttaaaaataa ctgetttatt gagatanaaa 316 

50 tcaccatgta attcatccac ttaaaatata cagttcagtg atttgtagta catttgaaga 376 

tatgtgtgac catcatc 393 



55 
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<210> 115 
<211> 49 
<212> PRT 

<213> Homo sapiens 
<400> 115 

Val Phe Gly Ala Tyr Ser Met Asp Val He Thr Ser Thr Ser Phe Gly 
1 5 10 15 

Val Asn He Asp Ser Leu Asn Asn Pro Gin Asp Pro Phe Val Glu Asn 
20 25 30 

Thr Lys Lys Leu Leu Arg Phe Asp Phe Leu Asp Pro Phe Phe Leu Ser 
35 40 45 

He 



20 



<210> 116 

<211> 240 

<212> DNA 

25 <213> Homo sapiens 



<220> 

<221> intron • 
<222> (1) . . (82) 

<220> 

<221> exon 

<222> (83) . . (149) 

<220> 

<221> intron 
<222> (150) . . (240) 



30 



35 



<221> CDS 
M <222> (83) . . (148) 



50 



55 



<400> 116 

ggagatcaag gaccacgctt gtgatttact tctgacttca ggagccactt tctgtcagtg 60 

aaatttctct ttttgcttct ag cac cga gtg gat tec ctt cag ctg atg att 112 

His Arg Val Asp Phe Leu Gin Leu Met He 
1 5 10 

gac tct cag aat tea aaa gaa act gag tec cac aaa ggtaaccaga 158 
Asp Ser Gin Asn Ser Lys Glu Thr Glu Ser His Lys 
15 20 

gtgtttctga gggctacttg tggggcactc agagggaagg ccttgttctg aaaatgtgca 218 



54 
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10 



15 



ggaagtattc caggatgatg ag 240 



<210> 117 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 117 

His Arg Val Asp- Phe Leu Gin Leu Met lie Asp Ser Gin Asn Ser Lys 
1 . 5 10 15 

Glu Thr Glu Ser His Lys 
20 



<210> 118 
20 <211>399 

<212> DNA 

<213> Homo sapiens 

<220> 

25 <221> intron 

<222> (1) . . (Ill) 

<220> 

<221> exon 

<222> (112) . . (338) 



30 



35 



40 



45 



50 



<220> 

<221> intron 
<222> (339) . . (399) 

<220> 

<221> CDS. 

<222> (112) . . (336) 

<400> 118 

ccagtatgag ttgttctctg gaacttctaa cagttcaaca gtactacatg gactgagtta 60 

aaagttaatt caaaaatctc aatttatcca aatctgtttc tttcttttca g>gca cca 117 

Ala Pro 
1 

ccc acc tat gat act gtg eta cag ttg gag tat ctt gac atg gtg gtg 165 
Pro Thr Tyr Asp Thr Val Leu Gin Leu Glu Tyr Leu Asp Met Val Val 
5 10 15 

aat gaa aca etc aga tta ttc cca gtt get atg aga ctt gag agg gtc 213 
Asn Glu Thr Leu Arg Leu Phe Pro Val Ala Met Arg Leu Glu Arg Val 
20 25 30 



55 
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tgc aaa aaa gat" gtt gaa ate aat ggg atg ttt att ccc aaa ggg gtg 
Cys Lys Lys Asp Val Glu lie Asn Gly Met Phe lie Pro Lys Gly' Val 
35 40 45 50 



gtg gtg atg att cca age tat gtt ctt cat cat gac cca aag tac tgg 
Val Val Met lie Pro Ser Tyr Val Leu His His Asp Pro Lys Tyr Trp 
• 55 60 65 



asa gag cct gag aag ttc etc cct gaa aggtaggagg cccctgggaa 
Xaa Glu Pro Glu Lys Phe Leu Pro Glu 
70 75 



gggagccctc cctgaaccag cctggttcaa gcatattctg cct 



<210> 119 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<400> 119 

Ala Pro Pro Thr Tyr Asp Thr Val Leu Gin Leu Glu Tyr Leu Asp Met 
1 5 10 15 . 

Val Val Asn Glu Thr Leu Arg Leu Phe Pro Val Ala Met Arg Leu Glu 
20 . 25 30 

Arg Val Cys Lys Lys Asp Val Glu lie Asn Gly Met Phe lie Pro Lys 
35 40 45 

Gly Val Val Val Met He Pro Ser Tyr Val Leu His His Asp Pro Lys 
50 55 60 

Tyr Trp Xaa Glu Pro Glu Lys Phe Leu Pro Glu 
65 70 , 75 



<210> 120 
<211> 962 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (66) 



<220> 

<221> intron 
<222> (348) . . (750) 

<220> 

<221> exon 

<222> (751) . . (894) 
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<220> 

<221> intron 
<222> (895) . . (962) 

<220> 
<221> CDS 

<222> (800) . . (892) 
<400> 120 

tcaagtgctg gacttgggac ttaggagggg caatggagcc gcttagtgcc tacatctgac 60 

ttggactgaa atataggtga gagacaagat tgtctcatat ccggggaaat cataacctat 120 

gactaggacg ggaagaggaa geactgcctt tacttcagtg ggaatctcgg cctcagcctg 180 

caagccaagt gttcacagtg aaaaaagcaa gagaataagc taatactcct gtcctgaaca 240 

aggcagcggc tccttggtaa agctactcct tgatcgatcc tttgcaccgg attgttcaaa 300 

gtggacccca ggggagaagt cggagcaaag aacttaccac caagcaggta tggtttttct 360 

ttctttctct tttgctgggg gctgaccgcc cttcagctcc agccaaaaga tgtgtgtgaa 420 

cacaaatata ccttctgttt gaggtcagca tcatagtggg tcgtgaatca tgttggcctt 480 

gctgctgtct cctcatttct agggtgaaaa aaaaaaagca tgaaaacaat cacttaatgt 540 

tgagccccat tactgatgct ctctggtcct gcactagcct cctagaaaaa tcaccacagc 600 

cttaactact gcatgagtta ccacaagtca cacatacaac cagctccctg ttacagggct 660 

ggagtccctg gacceaggaa ataccacctc caaggactgt gggagctggg gactatggga 720 

actgggatca actcagtcct gattcctttt ggcctgctgg gttagtgctg gcagcccccc 780 

tgaggccaag gacagcagc atg aca gtc acc agg act cac cac ttc aag gag 832 

Met Thr Val Thr Arg Thr His His Phe Lys Glu 
1 5 10 

ggg tec etc aga gca cct gec ata ccc ctg cac agt get gcg get gag 880 
Gly Ser Leu Arg Ala Pro Ala lie Pro Leu His Ser Ala Ala Ala Glu 
15 20 ' 25 

ttg get tea aac cagtgagttt tctacctcta ctattgaaag ggcaccttgt 932 
Leu Ala Ser Asn 
30 

cccacagaac egagtcttge ctgcatgtgg 962 



<210> 121 
<211> 31 
<212> PRT 
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<213> Homo sapiens 
<400> 121 

5 Met Thr Val Thr Arg Thr. His His. Phe Lys Glu Gly Ser Leu Arg Ala 

1 5 10 15 

Pro Ala lie Pro Leu His Ser Ala Ala Ala Glu Leu Ala Ser Asn 
20 ' 25 30 

10 



15 



20 



25 



30 



35 



40 



50 
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<210> 


122 


<211> 


345 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


intron 


<222> 


(1).. . (60) 


<220> 




<221> 


exon 


<222> 


(61) (279). 


<220> 




<221> 


intron 


<222> 


(280) . . (345) 


<220> 




<221> 


CDS 


<222> 


(62)-. . (277) 


<400> 


122 



ctgaggcctc tacacatccc tgtccagtct tttcattctc tgtggttttc tcatttctag 60 

t cca aga ggc cca gaa gca aac ctg gag gtg aga ccc aaa gaa age tgg 109 
Pro Arg Gly Pro Glu Ala Asn Leu Glu Val Arg Pro Lys Glu Ser Trp 
1 5 : 10 15 

aac cat get gac ttt gta cac tgt gag gac aca gag tct gtt cct gga 157 
Asn His Ala Asp Phe Val His Cys Glu Asp Thr Glu Ser Val Pro Gly 
.20 25 30 



aag ccc agt gtc aac gca gat gag gaa gtc gga ggt ccc caa ate tgc 205 

Lys Pro Ser Val Asn Ala Asp Glu Glu Val Gly Gly Pro Gin lie Cys 

45 35 40 45 

cgt gta tgt 1 ggg gac aag gec act ggc tat cac ttc aat gtc atg aca 253 

Arg Val Cys Gly Asp Lys Ala Thr Gly Tyr His Phe Asn Val Met Thr 

50 55 60 



tgt gaa gga tgc aag ggc ttt ttc aggtagagtt acccatcagc cttcacccac 307 
Cys Glu Gly Cys Lys Gly Phe Phe 
65 ' 70 
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gtgccaccac tgacccactg ggtaacatct cagggcct 



<210> 123 

<211> 72 

<212> PRT 

<213> Homo sapiens 

<400> 123 

Pro Arg Gly Pro Glu Ala Asn Leu Glu Val Arg Pro Lys Glu Ser Trp 
1 5 10 15 

Asn His Ala Asp Phe Val His Cys Glu Asp Thr Glu Ser Val Pro Gly 
20 25 30 

Lys Pro Ser Val Asn Ala Asp Glu Glu Val Gly Gly Pro Gin lie Cys 
35 40 45 

Arg Val Cys Gly Asp Lys Ala Thr Gly Tyr His Phe Asn Val Met Thr 
50 55. 60 

Cys Glu Gly Cys Lys Gly Phe Phe 
65 70 



<210> 124 
<211> 344 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1)..(78) 

<220> 

<221> exon 

<222> (79) . . (212) 

<220> 

<221> intron 
<222> (213) . . (344) 

<220> 

<221> CDS 

<222> (79) . . (210) 

<400> 124 

aggtggtatg gcccggagcc ccaggccgag ggcccgggca cccgtgcatc cccccttctg 

ctccccattc tctcacag gag ggc cat gaa acg caa cgc ccg get gag gtg 

Glu Gly His Glu Thr Gin Arg Pro Ala Glu Val 
15 10 
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10 



15 



30 



35 



40 



50 



ccc ctt ccg gaa ggg cgc ctg cga gat cac ccg gaa gac ccg gcg aca 159 
Pro Leu Pro Glu Gly Arg Leu Arg Asp His Pro Glu Asp Pro Ala Thr 
15 20 25 

gtg cca ggc ctg ccg cct gcg caa gtg cct gga gag egg cat gaa gaa 207 
Val Pro Gly Leu Pro Pro Ala Gin Val Pro Gly Glu Arg His Glu Glu 
30 35 4 0 

gga gagtgagcag tgggcgcgcg ggegggcegg cgccggggtg cacggctctg 260 
Gly 

agtaaggacg. tgccgtgggt gtgtgcatgc ttgtgtggag atgcgcgccg agtgtgcgcg 320 

tgaacacacg tgcacatgtg agct 344 



<210> 125 
<211> 44 
20 <212> PRT , 

<213> Homo sapiens 

<400> 125 

Glu Gly His Glu Thr Gin Arg Pro Ala Glu Val Pro Leu Pro Glu Gly 

25 1 5 : * 10 15 

Arg Leu Arg Asp His Pro Glu Asp Pro Ala Thr Val* Pro Gly Leu Pro 
20 25 30 



Pro Ala Gin. Val Pro Gly Glu Arg His Glu Glu Gly 
3 5 4 0 



<210> 126 

<211> 367 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> intron 
<222> (1) . . (103) 



<220> 

<221> exon 

45 <222> (104) . . (291) 

<220> 

<221> intron 

<222> (292) . . (367) 



<220> 
<221> CDS 

<222> (106) ... (291) 
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<400> 126 

ctaacggctt ctgctgcctt gagagggtta cacagtggct ctccaggggg ctggaggctc 60 . 

accaggggca cgtgtgcctg agccagcctc actgtccctg cagtg ate atg tec gac 117 

lie Met Ser Asp 
1 

gag gec gtg gag gag agg egg gec ttg ate aag egg aag aaa agt gaa 165 
Glu Ala Val Glu Glu Arg Arg Ala Leu lie Lys Arg Lys Lys Ser Glu 
5 10 IS 20 

egg aca ggg act cag cca ctg gga gtg cag ggg ctg aca gag gag cag 213 
Arg Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu Thr Glu Glu Gin 
25 • 30 35 

egg atg atg ate agg gag ctg atg gac get cag atg aaa acc ttt gac 261 
Arg Met Met lie Arg Glu Leu Met Asp Ala Gin Met Lys Thr Phe Asp 
40 . 45 50 v 

act acc ttc tec cat ttc aag aat ttc egg gtaggaggaa ctgcacagtg 311. 
Thr Thr Phe Ser His Phe Lys Asn Phe Arg 
55 60 

25 acccgaggtg tcactgccat cttcattctc acatagaaac tgaggttccc caagga 367 

<210> 127 

<211> 62 

<212> PRT 

<213> Homo sapiens 



10 



15 



20 



30 



35 



<400> 127 

lie Met Ser Asp Giu Ala Val .' Glu Glu Arg Arg Ala Leu' lie Lys Arg 

1 5 10 15 

Lys Lys Ser Glu Arg Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu 

20 25 30 

Thr Glu Glu Gin Arg Met Met He Arg Glu Leu Met Asp Ala Gin Met 

40 35 40. 45 

Lys Thr Phe Asp Thr Thr Phe Ser His Phe Lys Asn Phe Arg 

50 55 ~ 60 



45 



50 



55 



<210> 128 
<211> 415 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
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<222> (1) . . (69) 
<220> 

<221> exon 

<222> (70) . . (344) 

<220> 

<221> intron 
<222> (345) . . (415) 

<220> 

<221> CDS 

<222> (70) . . (342) 

<400> 128 

ctgagttggg acctgtctat gaaagcacat gctgtctctc ctctgtccac ctcctggcat 60 

gtgtcctag ctg cca ggg gtg ctt age agt ggc tgc gag ttg cca gag tct 111 
Leu Pro Gly Val Leu Ser Ser Gly Cys Glu Leu Pro Glu Ser 
1 5 10 

ctg cag gec cca teg agg gaa gaa get gee aag tgg age cag gtc egg 159 
Leu Gin Ala Pro Ser Arg Glu Glu Ala Ala Lys Trp Ser Gin Val Arg 
15 20, 25 30 

aaa gat ctg tgc tct ttg aag gtc tct "ctg cag ctg egg ggg gag gat 207 
Lys Asp Leu Cys Ser Leu Lys Val Ser Leu Gin Leu Arg Gly Glu Asp 
35 40 45 

ggc agt gtc tgg aac tac aaa ccc cca gec gac agt ggy ggg aaa gag 255 
Gly Ser Val Trp Asn Tyr Lys Pro Pro Ala Asp Ser Xaa Gly Lys Glu 
50 55 60 

ate ttc tec ctg ctg ccc cac atg get gac atg tea acc tac atg ttc 303 
lie Phe Ser Leu Leu Pro His Met Ala Asp Met Ser Thr Tyr Met Phe 
65. 70 75 

aaa ggc ate ate age ttt gee aaa gtc ate tec tac ttc aggtaggaca 352 
Lys Gly lie lie Ser Phe Ala Lys Val He Ser Tyr Phe 
80 85 90 

tggagactgg gtggttgggt gtggaaaaga actggaagtg gecaggaggt teaaagggee 412 

tgg 415 



<210> 129 

<211> 91 

<212> PRT 

<213> Homo sapiens 

<400> 129 

Leu Pro Gly Val Leu Ser Ser Gly Cys Glu Leu Pro Glu Ser Leu Gin 
1 " 5 10 15 
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w 



15 



20 



30 



35 



40 



50 
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Ala Pro Ser Arg Glu Glu Ala Ala Lys Trp Ser Gin Val Arg Lys Asp 
20 25 30 

Leu Cys Ser Leu Lys Val Ser Leu Gin Leu Arg Gly Glu Asp Gly Ser 
35 40 45 

Val Trp Asn Tyr Lys Pro Pro Ala Asp Ser Xaa Gly Lys Glu He Phe 
50 55 60 

Ser Leu Leu Pro His Met Ala Asp Met Ser Thr Tyr Met Phe Lys Gly 
65 70 75 80 

lie lie Ser Phe Ala Lys Val He Ser Tyr Phe 
85 90 



<210> 130 

<211> 598 

<212> DNA 

<213> Homo sapiens 



<220> 

25 <221> intron 

<222> {1) . . (66) 

'. <220> 
<221> exon 
<222> (67) . . (209) 



<220> 

<221> intron 
<222> (210) . . (410) 

<220> 

<221> exon 

<222> (411) . . (527) 

<220> 

<221> intron 
<222> (528) . . (598) 



<220> 
<221> CDS 
45 <222> (68) . . (209) 

<220> 
<221> CDS 

<222> (411) . . (526) 



<400> 130 

ctgctggtgc cggcctgtgg gctgcctccc agggagctgt cctcccctcc ccatccttgc 60 
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tgccagg gac ttg ccc ate gag gac cag ate tec ctg ctg aag ggg gec 109 
Asp Leu Pro.Ile Glu Asp Gin lie Ser Leu Leu Lys Gly Ala 
1 5 10 

get ttc gag ctg tgt caa ctg aga ttc aac aca gtg ttc aac gcg gag 157 
Ala Phe Glu Leu Cys Gin Leu Arg Phe Asn Thr Val Phe Asn Ala Glu 
IS 20 25 30 

act gga ace tgg gag tgt ggc egg ctg tec tac tgc ttg gaa gac act 205 
Thr Gly Thr Trp Glu Cys Gly Arg Leu Ser Tyr Cys Leu Glu Asp Thr 
35 40 45 

gea ggt geccgagaga gcctgcctgc cctggcagag ggagggaaac actgeagtta 261 
Ala 

tgggaggaag ggagctaege caggatatgc aggttctggg atggcagggc aggaagatgg 321 

aatggtggaa aacaagatat tggtgaggga tgattagatc ttggtcagct tgctgagaag , 38 1 

ctgcccctcc atyctgttac catccacagg t ggc ttc cag caa ctt eta ctg 433 

Gly Phe Gin Gin Leu Leu Leu 
50 55 

gag ccc atg ctg aaa ttc cac tac atg ctg aag aag ctg cag ctg cat 481 
Glu Pro Met Leu Lys Phe His Tyr Met Leu Lys Lys Leu Gin Leu His 
60 65 70 

gag gag gag tat gtg ctg atg cag gec ate tec etc ttc tec cca 526 
Glu Glu Glu Tyr Val Leu Met Gin Ala lie Ser Leu Phe Ser Pro 
75 80 85 

ggtgaggatc tcccctaggc tgectgacat cccccccagc cttatctgcc ctccccaggg 586 
aaggtcccag tc 5gg 

<210> 131 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<400> 131 

Asp Leu Pro lie Glu Asp Gin He Ser Leu Leu Lys Gly Ala Ala Phe 
I 5 10 15 

Glu Leu Cys Gin Leu Arg Phe Asn Thr Val Phe Asn Ala Glu Thr Gly ' 
20 25 30 

Thr' Trp Glu Cys Gly Arg Leu Ser Tyr Cys Leu Glu Asp Thr Ala Gly 
35 40 45 

Gly Phe Gin Gin Leu Leu Leu Glu Pro Met Leu Lys Phe His Tyr Met 
50 55 60 
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' Leu Lys Lys Leu Gin Leu His Glu Glu Glu Tyr Val Leu Met Gin Ala 

65 70 .75 80 

5 lie Ser Leu Phe Ser Pro 

85 



10 



15 



25 



30 



35 



40 



45 



<210> 132 
<211> 271 
<212> DMA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (77) 



<220> 

<221> exon 
20 <222> (78) . . (183) 

<220> 

<221> intron 
<222>. (184) . . (271) 



<220> 

<221> CDS 

<222> (80) . . (181) 

<400> 132 

gagcaatgcc ctgactctgg gctggactga gcttgtcttt gccccatgat cttgcaccac 60 

acctccctcc cctccagac cgc cca ggt gtg ctg cag cac cgc gtg gtg gac 112 

Arg Pro Gly Val Leu Gin His Arg Val Val Asp 
1 5 10 

cag ctg cag gag caa ttc gcc att act ctg aag tec tac att gaa tgc 160 
Gin Leu Gin Glu Gin Phe Ala He Thr Leu Lys Ser Tyr He Glu Gys 
15 20 * 25 

aat egg ccc cag cct get cat aggtgagcac agcagggggt gaggacccgt 211 
Asn Arg. Pro Gin Pro Ala His 
30 

gagggtgatg tgagggagee gaggttcagg gaaattgece aagact-tcat ggccagaggg 271 



<210> 133 
<211> 34 
<212> PRT 
50 <213> Homo sapiens 

<400> 133 

Arg Pro Gly Val Leu Gin His Arg Val Val Asp Gin Leu Gin Glu Gin 

55 
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15 



35 



40 



45 



15 10 15 

Phe Ala lie Thr Leu Lys Ser Tyr lie Glu Cys Asn Arg Pro Gin Pro 
20 25 30 

Ala His 



10 

<210> 134 
<211> 324 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (83) 

20 <220> 

<221> exon 

<222> (84) . . (224) 

<220> 

25 <221> 3'UTR 

<222> (226) . . (324) 

<220> 
<221> CDS 
30 <222> (85) . . (225) 

<400> 134 

tgcttgtgca gcctcagagc agccctgagg cttgtgggtc agggcgggct gcacccacaa 60 

tcttttctct ggctggcatg cagg ttc ttg ttc ctg aag ate atg get atg 111 

Phe Leu Phe Leu Lys lie Met Ala Met 
1 5 

etc acc gag etc cgc age ate aat get cag cac ace cag egg ctg ctg 159 
Leu Thr Glu Leu Arg Ser lie Asn Ala Gin His Thr Gin Arg Leu Leu 
10 15 . 20 25 

cgc ate cag gac ata cac ccc ttt get acg ccc etc atg cag gag ttg 207 
Arg lie Gin Asp lie His Pro Phe Ala Thr Pro Leu Met Gin Glu Leu 
30 35 40 

ttc ggc ate aca ggt age tgagcggctg cccttgggtg acacctccga 255 
Phe Gly He Thr Gly Ser 
45 

50 gaggcageca gacccagagc cctctgagcc gccactcccg ggecaagaca gatggacact 315 

gecaagage 324 
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<210> 135 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<400> 135 



10 


Fhe 
1 


Leu 


Phe 


Leu 


Lys 
5 


He 


Met 


Ala 


Met 


Leu 
10 


Thr 


Glu 


Leu 


Arg 


Ser 
15 


He 




As.n 


Ala 


Gin 


His 
20 


Thr 


Gin 


Arg 


Leu 


Leu 
25 


Arg 


He 


Gin 


Asp 


He 
30 


His 


Pro 


15 


Phe 


Ala 


Thr 


Pro 


Leu 


Met 


Gin 


Glu 


Leu 


Phe 


Gly 


He 


Thr 


Gly 


Ser 





35 40 45 



20 

Claims 

1. A polynucleotide selected from the group consisting of: 

25 (a) polynucleotides comprising a nucleotide sequence encoding the amino acid sequence encodable by a 

nucleotide sequence of SEQ ID NO: 86, 94 and/or 98; 

(b) polynucleotides comprising a nucleotide sequence encoding a molecular variant of the cytochrome 
CYP3A4 protein by way of at least one amino acid deletion, addition and/or substitution at an amino acid 
position corresponding amino acid residue Gly56 in exon 3 of the CYP3A4 gene; 
30 (c) polynucleotides comprising a nucleotide sequence encoding a molecular variant of the cytochrome 

CYP3A7 protein by way of at least one amino acid deletion, addition and/or substitution, preferably at an amino 
acid position corresponding amino acid residue Thr 409 in exon 11 of the CYP3A7 gene; 

(d) polynucleotides comprising a nucleotide sequence of any one of SEQ ID NOS: 86, 87, 90,91, 94, 95 or 108; 

(e) polynucleotides comprising a nucleotide sequence of any one of SEQ ID NOS: 98, 99, 102, 103, 106 or 
35 107; and 

(f) polynucleotides comprising a nucleotide sequence encoding a CYP3A4 or CYP3A7 polypeptide or fragment 
thereof having an epitope comprising an amino acid sequence encoded by the nucleotide sequence of SEQ 
ID NO: 86 and/or 94. 



40 2. The polynucleotide of claim 1 . wherein said polynucleotide encodes a variant CYP3A4, CYP3A7 or hPXR protein 
or fragment thereof. 

3. The polynucleotide of claim 1 or 2, wherein said amino acid substitution comprises Gly56 to Asp in exon 3 of the 
CYP3A4 gene and/or Thr409 to Arg in exon 11 of the CYP3A7 gene. 



45 
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4. The polynucleotide of any one of claims 1 to 3, wherein the nucleotide deletion, addition and/or substitution result 
in altered expression of the variant CYP3A4, CYP3A7 or hPXR gene compared to the corresponding wild type gene. 

5. A vector comprising the polynucleotide of any one of claims 1 to 4. 

6. The vector of claim 5, wherein the polynucleotide is bperatively linked to expression control sequences allowing 
expression in prokaryotic or eukaryotic cells. 

7. A host cell genetically engineered with the polynucleotide of any one of claims 1 to 4 or the vector of claim 5 or 6. 

8. A method for producing a molecular variant CYP3A4, CYP3A7 or hPXR protein or fragment thereof comprising 

(a) culturing the host cell of claim 7; and 
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(b) recovering said protein or fragment from the culture. 

9. A method for producing cells capable of expressing a molecular variant CYP3A4, CYP3A7 or hPXR gene com- 
prising genetically engineering cells with the polynucleotide of any one of claims 1 to 4 or the vector of claim 5 or 6. 

5 

10. A CYP3A4, CYP3A7 or hPXR protein or fragment thereof encoded by the polynucleotide of any one of claims 1 
to 4 or obtainable by the method of claim 8 or from cells produced by the method of claim 9. 

An antibody which binds specifically to the protein of claim 10. 

The antibody of claim 11 which specifically recognizes an epitope containing one or more amino acid substitution 
(s) as defined in any one of claims 1 to 4. 

13. A nucleic acid molecule complementary to a polynucleotide of any one of claims 1 to 4. 

14. A nucleic acid molecule capable of specifically recognizing and cleaving the polynucleotide of any one of claims 
1 to 4. 

15. A vector comprising the nucleic acid molecule of claim 13 or 14. 

20 

16. A transgenic non-human animal comprising at least one polynucleotide of any one of claims 1 to 4 or the vector 
of claim 5 or 6. 

17. The transgenic non-human animal of claim 16 further comprising at least one inactivated wild type allele of the 
25 CYP3A4, CYP3A7 or hPXR gene. 

18. The transgenic non-human animal of claim 16 or 17, which is a mouse or a rat. 

19. A method of identifying and obtaining a CYP3A4, CYP3A7 or hPXR inhibitor capable of modulating the activity of 
30 a molecular variant of the CYP3A4, CYP3A7 or hPXR gene or its gene product comprising the steps of 

(a) contacting the protein of claim 1 0 or a cell expressing a molecular variant CYP3A4, CYP3A7 or hPXR gene 
comprising a polynucleotide of any one of claims 1 to 4 in the presence of components capable of providing 
a detectable signal in response to drug metabolization, with a compound to be screened under conditions to 

35 permit CYP3A4- or CYP3A7-mediated drug metabolization, and 

(b) detecting the presence or absence of a signal or increase of a signal generated from the drug metaboli- 
zation, wherein the presence or increase of the signal is indicative for a putative inhibitor. 

20. The method of claim 19 wherein said cell is a cell of claim 7, obtained by the method of claim 9 or is comprised in 
to the transgenic non-human animal of any one of claims 16 to 18. * 

21. A method of identifying and obtaining an CYP3A4, CYP3A7 or hPXR inhibitor capable of modulating the activity 
of a molecular variant of the CYP3A4, CYP3A7 or hPXR gene or its gene product comprising the steps of 

45 (a) contacting the protein of claim 10 with a first molecule known to be bound by CYP3A4, CYP3A7 or hPXR 

protein to form a first complex of said protein and said first molecule; 

(b) contacting said first complex with a compound to be screened; and 

(c) measuring whether said compound displaces said first molecule from said first complex. 

50 22. The method of claim 21 , wherein said measuring step comprises measuring the formation of a second complex 
of said protein and said compound. 

23. The method of claim 21 or 22, wherein said measuring step comprises measuring the amount of said first molecule 
that is not bound to said protein. 

55 

24. The method of any one of claim 21 to 23 wherein said first molecule is nifedipine, rifampicine or corttcosterone. 

25. The method of any one of claims 21 to 24 wherein said first molecule is labeled. 
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26. A method of diagnosing a disorder related to the presence of a molecular variant of the CYP3A4, CYP3A7 or hPXR 
gene or susceptibility to such a disorder comprising 

(a) determining the presence of a polynucleotide of any one of claim 1 to 4 in a sample from a subject; and/or 
5 (b) determining the presence of a protein of claim 10. 

27. The method of claim 26, wherein said disorder is cancer. 

28. The method of claim 26 or 27 comprising PCR, ligase chain reaction, restriction digestion, direct sequencing, 
10 nucleic acid amplification techniques, hybridization techniques or immunoassays. 

29. The method of any one of claims 26 to 28, further comprising administering to a subject a medicament to abolish 
or alleviate said disorder. 

15 30. The method of any one of claims 26 to 29, further comprising introducing 

(i) a functional and expressible wild type CYP3A4, CYP3A7 or hPXR gene or 

(ii) a nucleotide acid molecule of claim 1 3 or 1 4 or the vector of claim 1 5 into cells. 

20 31. A method for the production of a pharmaceutical composition comprising the steps of the method of any one of 
claims 19 to 25; and 

(c) synthesizing and/or formulating the compound identified and obtained in step (b) or a derivative thereof 
in a pharmaceutical^ acceptable form. 

25 32. A method for the preparation of a pharmaceutical composition comprising formulating a drug or pro-drug in the 
form suitable for therapeutic application and preventing or ameliorating the disorder of the subject diagnosed in 
the method of claim 26 or 27. 

33. The method of claim 31 or 32 wherein said compound drug or prodrug is a derivative of a medicament as defined 
30 . in claim 29. 

34. An inhibitor identified or obtainable by the method of any one of claims 19 to 25. 
i 35. The inhibitor of claim 34 which binds specifically to the protein of claim 10. 

35 

■ 36. Use of an oligo- or polynucleotide for. the detection of a polynucleotide of any one of claims 1 to 4 and/or for 
genotyping of individual CYP3A4, CYP3A7 or hPXR alleles. 

37. The use of claim 36 wherein said polynucleotide is a polynucleotide of any one of claims 1 to 4 or a nucleic acid 
40 molecule of claim 13 or 14. 

38. The use of claim 36 wherein said oligonucleotide is about 15 to 50 nucleotides in length and comprises the nucle- 
otide sequence of any one of SEQ ID NOS: 1 to 107 or a complementary sequence. 

45 39. A primer or probe consisting of an oligonucleotide as defined in claim 38. 

40. Use of an antibody or a substance capable of binding specifically to the gene product of an CYP3A4, CYP3A7 or 
hPXR gene for the detection of the protein of claim 10, the expression of a molecular variant CYP3A4, CYP3A7 
or hPXR gene comprising a polynucleotide of any one of claims 1 to 4 and/or for distinguishing CYP3A4 or hPXR 

so alleles comprising a polynucleotide of any one of claims 1 to 4. r . 

41. A composition comprising the polynucleotide of any one of claims 1 to 4, the vector of claim 5 or 6, the host cell 
of claim 7 or obtained by the method of claim 9, the protein of claim 10, the antibody of claim 11 or 12, the nucleic 
acid molecule of claim 1 3 or 1 4, the vector of claim 1 5, the inhibitor of claim 34 or the primer or probe of claim 39. 

55 

42. The composition of claim 41 which is a diagnostic or a pharmaceutical composition. * 

43. Use of an effective dose of a drug or prodrug for the preparation of a pharmaceutical composition for the treatment 
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or prevention of a disorder of a subject comprising a polynucleotide of any one of claims 1 to 4 in its genome. 
44. The use of 43 wherein said disorder is cancer. 
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CYP3A4 polymorphisms 

Exon 3 (G235A; Gly56Asp) 



TC C CH C HC TTTTCT 




wt/mut 

Oligonucleotide 3A450F (forwards) 



Intron 7 (T-+G) * 



TAT'CTTGCTCT'CTT 




wt/mut 

Oligonucleotide 3A433F (forwards) 



A CYP3A7 exon 11 polymorphism (C1229G; Thr409Arg) 



TACT GGACAGjIGC.C 




wt/mut 

Oligonucleotide 3A742F (forwards) 
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hPXR polymorphisms 



Exon 1b (C149A, untranslated) 
wt/mut 




Oligonucleotide hPXR57F (forwards) 



Exon 5/6 (CC975TT, silent) 
wt/wt 

ACAGTGGCGGGA AA 




Oligonucleotide hPXR32F (forwards) 
(forwards) 



rriut/mut 



CA-GTGG.TGGGA&.AG 




Oligonucleotide 



hPXR32F 



Intron 7 (C-T) 
wt/mut 

CCtCCAT-HCfGTTAC 




Oligonucleotide hPXR36F (forwards) 



Fig. 6 
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Fig. 8 

CYP3A4 

Exon 3 (G235A Gly56Asp) 

CCTCTAACTfir.CAnCAAGTCTG ATTTCATTGGCTTCGACTGTTTTCATCCCAATTAGAGG 
CAGGGTTMGTACATTAAAAATMTMTCAMTATTATTTTGTTTCTCCTCCCASGG^C 
TTTTGTATGTTTGACATGGAATGTCATAAAAAGTATGGAAAAGTGTGGGGGJGAGTATT 

CTGGAAACTTCCATTGGATAGACTTGTTTCTATGATGAGTTTACCCCACTGCACAQAGJiA 
CAGTCTCAGCCC 

Exon 4 

AGTCTGGCTTCCTGGGTTGGGGTC CAGCTGTAGAATAAGGCTGTTGATGTTTAATCAAC 
TCTG I I I I I I I CACACAGCTTTTATGATGGTCAACAGCCTGTGCTGGCTATCACAGATC 
CTGACATGATCAAAACAGTGCTAGTGAAAGAATGTTATTCTGTCTTCACAAACCGGAG 
GGJIAAGCATTCATGTGTTGAAATTAAAATACTGATTGATTAAATTTATATTTTGAAATTCTT 
ATATATTnATAGACAGTTGCCTAAAAAA TGTCCAGGAAGGTTCCACGTCCACTTC 

Exon 5 

CTACAACCATGGAGACCTCC ACAACTGATGTAGGACAAAATGTTTCTGCTTTGAACTCT 

AQCCTTTTGGTCCAGTGGGATTTATGAAAAGTGCCATCTCTATAGCTGAGGATGAAGA 

ATGGAAGAGATTACGATCATTGCTGTCTCCAACCTTCACCAGTGGAAAACTCAAGGA 

GGTATGAAAATAACATGAGTTTTAATAAGAAACTTAAAGAA TGAATCTGGTGGGGACAG 
GTA 

Exon 7 (T/G intron 7 polymorphism) 

GTCTGTCTTGACTGGACATGTGG CTTTCCTGATGCACGCATAGAGGAAGGATGGTAAA 

AAGGTGCTGATTTTAATTTTCCACATCTTTCTCCACTCAQCGTCTTTGGGGCCTACAGC 

ATGGATGTGATCACTAGCACATCATTTGGAGTGAACATCGACTCTCTCAACAATCCAC 

AAGACCCCTTTGTGGAAAACACCAAGAAGCTTTTAAGATTTGA I I I I I I GGATCCATTC 

TTTCTCTCAATAAGTATGTGGACTACTATTTCCTTTTATTTATCTT T^G CTCTCTTAAAAA 

TAACTGCTTTATTGAGATATAAATCACCATGTAATTCATCCACTTAAAATATACAGTTCAG 

TGATTTGTAGTACATT TGAAGATATGTGTGACCATCATC 
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Fig; 8 (continued) 

Exon 9 

GGAGATCAAGGACCACGCTTGTG ATTTACTTCTGACTTCAGGAGCCACTTTCTGTCAGT 
GAAATTTCTC Nil rGCTTCTAQCACCGAGTGGATTTCCTTCAGCTGATGATTGACTCTC 
AGAATTCAAAAGAAACTGAGTCCCACAAAGCTAACCAGAGTGTTTCTGAGGGCTACTT 

^Tf^r;^r;nArTnAnAr;nGAAGGCCTTGTTCTGAAAATGT GCAGGAAGTATTCCAGGATn 
ATGAG 

CYP3A7 ' 

Exon 11 (C1229G Thr409Arg polymorphism) , 

CCAGTATGAGTTGTTCTCTGG AACTTCTAACAGTTCAACAGTACTACATGGACTGAGTTA 

AAAGTTAATTGAAAAATCTCAATTTATCCAAATCTGTTTCTTTCTTTT 

CCTATGATACTGTGCTACAGTTGGAGTATCTTGACATGGTGGtGAATGAAACACTCAG 

ATTATTCCCAGTTGCTATGAGACTTGAGAGGGTCTGCAAAAAAGATGTTGAAATCAAT 

GGGATGTTTATTCCCAAAGGGGTGGTGGTGATGATTCCAAGCTATGTTCTTCATCATG 

ACCCAAAGTACTGGAC^>GAGAGCCTGAGAAGTTCCTCCCTGAAAGGIAGGAGGCCC 

CTGGGAAGGGAGCCCTCCCTGAACC AGCCTGGTTCAAGCATATTCTGCCT 



hP XR 

Exon 1b&1a {in bold in exon 1b: nn 1-280 of hPXR cDNA (GenBank AF061 056.1), 
dotted: nn 36-257 of PARI cDNA (GenBank AF084645)) 

TCAAGTGCTGGACTTGGGAC TTAGGAGGGGCAATGGAGGCGnTTAGTGnrTArATrTri 

ACTTGGACtGAAATATAGGTGAGAGACAAGATTGTCTCATATCCGGGGAAATGATAAC 

CTATGACTAGGACG 

P.AGCCTGC^ 

GIQCTGAA<^=>^^ 

CCG.GAn.Gn.CAAA.^ 

GCAGGIATGGTTTTTCT^ 

CCAAAAGATGTGTGTGAACACAAATATACCTTCTGTT 
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Fig. 8 (continued) 

T GAGGTCAGCATCATAGtGGG TCGTGAATCATGTTGGCCTTGCTGCTGTCTCCTCATTT 

CTAGGGTGAAAAAAAAAAAGCATGAAAACAATCACTTAATGTTGAGCCCCATTA 

CTGATGCTCTCTGGTCCTGCACTAGCCTCCTAGAAAAATCACCACAGCCTTAACTACTG 

n ^iT rz A ^tt A r.r Ar; A A^TP. AP.AP. ATAHAACG AGCTCCCTGTTAC AGGGCTGG AGTO.nr.yn 

GACCCAGGAAATACCACCTCCAAGGACTGTGGGAGCTGGGGACTATGGGAACTGGGAT 

CAACTCAGTCCTGATTCCTTTTGGCCTGCTGGGTTAGTGCTGGCAGCCCCCCTGAGGC 

CAAGGACAGCAGCATGACAGTCACCAGGACTCACCACTTCAAGGAGGGGTCCCTCAG 

AGCACCTGCCATACCCCTGCACAGTGCTGCGGCTGAGTTGGCTTCAAACCASIGAGTT 

TTP.TAPPTPTAPTATTaAAAnnnCACCTTGTCCCACAGAACC GAGTCTTGCCTGCATGT 

£S 

EXON 2 

CTGAGGCGTCTACACATC CCTGTCCAGTCTTTTCATTCTCTGTGGTTTTCTCATTTCTAGT 

CCAAGAGGCCCAGAAGCAAACCTGGAGGTGAGACCCAAAGAAAGCTGGAACCATGC 

TGACTTTGTACACTGTGAGGACACAGAGTCTGTTCCTGGAAAGCCCAGTGTCAACGCA 

GATGAGGAAGTCGGAGGTCCCCAAATCTGCCGTGTATGTGGGGACAAGGCCACTGGC 

TATCACTTCAATGTCATGACATGTGAAGGATGCAAGGGCTTTtTCAGSIAGAGTTACCC 

ATnAnPPTTPACCCACGTGCCACCACTGACCCACTG GGTAACATCTCAGGGCCT 

Exon 3 

AGGTGGTATGGCCCGGAGC CCCAGGCCGAGGGCCCGGGCACCCGTGCATCCCCCCTT 
CTGCTCCCCATTCTCTCACASGAGGGCCATGAAACGCAACGCCCGGCTGAGGTGCCC 
CTTCCGGAAGGGCGCCTGCGAGATCACCCGGAAGACCCGGCGACAGTGCCAGGCCT 
GCCGCCTGCGCAAGTGCCTGGAGAGCGGCATGAAGAAGGAGAGTGAGCAGTGGGCG 
CGCGGGCGGGCCGGCGCCGGGGTGCACGGCTCTGAGTAAGGACGTGCCGTGGGTGT 
GTGCATGCTTGTGTGGAGATGCGCGCCGAGTGTGCGCGTGAACACACQISCACAISIQ 
AQCT 

EXON 4 . . . 

CTAACGGCTTCTGCTGCC TTGAGAGGGTTACACAGTGGCTCTCCAGGGGGCTGGAGGC 
TCACCAGGGGCACGTGTGCCTGAGCCAGCCTCACTGTCCCTGCAGTGATCAT 
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Fig. 8 (continued) 

GfCCGACGAGGCCGTGGAGGAGAGGCGGGCCTTGATCAAGCGGAAGAAAAGTGAAC 
GGACAGGGACTCAGCCACTGGGAGTGCAGGGGCTGACAGAGGAGCAGCGGATGATG 
ATCAGGGAGCTGATGGACGCTCAGATGAAAACCTTT6ACACTACCTTCTCCCATTTCA 

AGAATTTCCGGGTAGGAGGAACTGCACAGTGACCCGAGGTGTCACTGCCATCTTCATT 
r/TP.APATAGA AACTGAGGTTCCCCAAGGA 

Exon5/6 

CTGAGTTGGGACCTGTCT ATGAAAGCACATGCTGTCTCTCCTCTGTCCACCTCCTGGCA 

TGTGTCCTASCTGCCAGGGGTGCTTAGCAGTGGCTGCGAGTTGCCAGAGTCTCTGCA 

GGCCCCATCGAGGGAAGAAGCTGCCAAGTGGAGCCAGGTCCGGAAAGATCTGTGCT 

CTTTGAAGGTCTCTCTGCAGCTGCGGGGGGAGGATGGCAGTGTCTGGAACTACAAAC 

CCCCAGCCGACAGTGGC^IGGGAAAGAGATCTTCTCCCTGCTGCCCCACATGGCTGA 

CATGTCAACCTACATGTTCAAAGGCAfCATCAGCTTTGCCAAAGtCATCTCCTACTTCA 

GfiTAGGACATGGAGACTGGGTGGTTGGGTGTGGAAAAGAACTGGAAGTGGCCAGfiAQ 

GTTCAAAGGGCCTGG . 

EXONS 7&8 

CTGCTGGTGCCGGCCTGT GGGCTGCCTCCCAGGGAGCTGTCCTCCCCTCCCCATCGTT 

GCTGCCAQGGACTTGCCCATCGAGGACCAGATCTCCCTGCTGAAGGGGGCCGCTTTC 

GAGCTGTGTCAACTGAGATTCAACACAGTGTTCAACGCGGAGACTGGAACCTGGGAG 

TGTGGCCGGCTGTCCTACTGCTTGGAAGACACTGCAGSIGCCCGAGAGAGCCTGCCT 

GCCCTGGCAGAGGGAGGGAAACACTGCAGTTATGGGAGGAAGGGAGCTACGCCAGGA 

TATGCAGGTTCTGGGATGGCAGGGCAGGAAGATGGAATGGTGGAAAACAAGATATTGG 

TGAGGGATGATTAGATCTTGGTCAGCTTGCTGAGAAGCTGCCCCTCCAT C-»T CTGTTAC 

CATCCAGAgGTGGCTTCCAGCAACTTCTACTGGAGCCCATGCTGAAATTCCACTACAT 

GGTGAAGAAGCTGCAGCTGCATGAGGAGGAGTATGTGCTGATGCAGGCCATCTCCCT 

CTTCTCCCCAGGTGAGGATCTCCCCTAGGCTGCCTGACATCCCCCCCAGCCTTATCTG 

CCCTCC CCAGGGAAGGTCCCAGTC 

EXON9 

GAGCAATGCCCTGACTCT GGGCTGGACTGAGCTTGTCTTTGCCCCATGATCTTGCACCA 
CACCTCGCTCCCCTCCASACCGCCCAGGTGTGCTGCAGCACCGCGTGGTGG 
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Fig. 8 (continued) 

ACCAGCTGCAGGAGCAATTCGCCATTACTCTGAAGTCCTACATTGAATGCAATCGGCC 

CCAGCCTGCTCATAGSIGAGCACAGCAGGGGGTGAGGACCCGTGAGGGTGATGTGAG 
GGAGCCGAGG1TCAGGGAAATTGCCCAA GACTTCATGGCCAGAGGG 

EXON9A 

TGCTTGTGCAGCCTCAGA GCAGCCCTGAGGGTTGTGGGTCAGGGCGGGCTGCACCCA 

CAATCTTTTCTCTGGCTGGCATGC^GTTCTTGTTCCTGAAGATCATGGCTATGCTCAC 

CGAGCTCCGCAGCATCAATGCTCAGCACACCCAGCGGCTGCTGCGCATCCAGGACAT 

ACACCCCTTTGCTACGCCCCTCATGCAGGAGTTGTTCGGCATCACAGGTAGCIGAGC 

GGCTGCCCTTGGGTGACACCTCCGAGAGGCAGCCAGACCCAGAGCCCTCTGAGCCG 

CCACTCCCGGGCCAAGACAG ATGGACACTGCCAAGAGC 
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